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PREFACE 


Mathematicians have always been occupied with questions of 
maxima and minima. With Euclid one of the simplest problems 
of this character was: Find the shortest line which may be drawn 
from a point to a line, and in the fifth book of the conics of 
Apollonius of Perga occur such problems as the determination. 
of the shortest line which may be drawn from a point to a given 
conte section. 

It is thus seen that a sort of theory of maxima and minima 
was known long before the discovery of the differential calculus, 
and it may be shown that the attempts to develop this theory 
exercised considerable influence upon the discovery of the cal- 
culus. Fermat, for example, after making numerous restorations 
of two books of Apollonius, often cites this old geometer in his 
«method for determining maximum and minimum,” 1638, a work 
which in some instances is so closely related to the calculus 
that Lagrange, Laplace, Fourier, and others wished to consider 
Fermat as the discoverer of the calculus. This he probably would 
have been had he started from a somewhat more general point . 
of view, as in fact was done by Newton (Opuscula Newtont, J, 
86-88). 

Maclaurin (A Treatise of Fluxions, Vol. I, p. 214. 1742), wrote: 
«There are hardly any speculations in geometry more useful or 
more entertaining than those which relate to maxima and minima. 
Amongst the various improvements that began to appear in the 
higher parts of geometry about a hundred years ago, Mr. de 
Fermat proposed a method for finding the maxima and minima. 
How the methods that were then invented for the mensuration 
of figures and drawing tangents to curves are comprehended 
and improved by the method of Fluxions, may be understood 


from what has already been demonstrated. A general way of 
iii 


iv THEORY OF MAXIMA AND MINIMA 


resolving questions concerning maxima and minima is also de- 
rived from it, that is so easy and expeditious in the most 
common cases, and is so successful when the question is of a 
higher degree, when the difficulty is greater and other methods 
fail us, that this is justly esteemed one of the most admirable 
applications of Fluxions.” 

The theory of maxima and minima was rapidly developed 
along the lines of the calculus after the discovery of the latter. 
Mathematicians were at first satisfied with finding the necessary 
conditions for the solution of the problem. These conditions, how- 
ever, are seldom at the same time sufficient. In order to decide this 
last point, the discovery of further algebraic means was necessary. 
Descartes had already remarked, in a letter of March 1, 1638, that 
Fermat’s rule for finding maxima and minima was imperfect; and 
we shall see that many imperfections still existed for a long time 
after the invention of the calculus by Newton. 

As introductory to a course of lectures on the calculus of 
variations, I have for a number of years given a brief outline 
of the theory of maxima and minima. This outline is founded 
on the lectures that were presented by the late Professor 
Weierstrass in the University of Berlin. It treats the ordinary 
cases; that is, where the functions are everywhere regular and 
where the forms are either definite or indefinite. It was published 
as a bulletin of the University of Cincinnati in 1903. At that 
time I expected to publish another bulletin which was to treat 
the more special cases; for example, where only one-sided differ- 
entiation enters, the “ambiguous case,” where the form is semi- 
definite, etc. A treatment of these cases, the extraordinary cases, 
required more study than was anticipated. The bulletin has 
consequently been delayed so long that I have concluded to give 
an entirely new exposition of the whole theory. 

In the preface to the German translation by Bohlmann and 
Schepp of Peano’s Calcolo differenziale e principii di calcolo 
integrale, Professor A. Mayer writes that this book of Peano not 
only is a model of precise presentation and rigorous deduction, 
whose propitious influence has been unmistakably felt upon 
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almost every calculus that has appeared (in Germany) since that 
time (1884), but by calling attention to old and deeply rooted 
errors it has given an impulse to new and fruitful development. 

The important objection contained in this book (Nos. 133-136) 
showed unquestionably that the entire former theory of maxima 
and minima needed a thorough renovation; and in the main 
Peano’s book is the original source of the beautiful and to a 
great degree fundamental works of Scheeffer, Stolz, Victor v. 
Dantscher, and others, who have developed new and strenuous 
theories for extreme values of functions. Speaking for the 
Germans, Professor A. Mayer, in the introduction to the above- 
mentioned book, declares that there has been a long-felt need 
of a work which, for the first time, not only is free from mis- 
takes and inaccuracies that have been so long in vogue but 
which, besides, so incisively penetrates an important field that 
hitherto has been considered quite elementary. 

To a considerable degree these inaccuracies are due to one of 
the greatest of all mathematicians, Lagrange, and they have 
been diffused in the French school by Bertrand, Serret, and 
others. We further find that these mistakes are ever being 
repeated by English and American authors in the numerous 
new works which are constantly appearing on the calculus. 

It seems, therefore, very desirable in the present state of 
mathematical science in this country that more attention be 
given to the theory of maxima and minima; for it has a high 
interest as a topic of pure analysis and finds immediate appli- 
cation to almost every branch of mathematics. 

I have therefore prepared the present book for students who 
wish to take a more extended course in the calculus as intro- 
ductory to graduate work in mathematics. I do not believe in 
making university students study abstruse theories in foreign 
languages, and in this treatise it will be found that the peda- 
gogical side of the presentation is insisted upon; for example, 
the Taylor development in series is given under at least half a 


dozen different forms. 
HARRIS HANCOCK 
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THEORY OF MAXIMA AND MINIMA 


CHAPTER I 
FUNCTIONS OF ONE VARIABLE 
I. ORDINARY MAXIMA AND MINIMA 


1. A function /(«) which is uniquely defined for all values of 
x in the interval (a, b) is said* to have a greatest value or a maxi- 
mum for the value z=), situated between a and 8, if there is 
a positive quantity 6 such that for all values of h between — 6 
and + 6 the difference 

[1] SF (®+ h) —f(%) = 0 
exists, which at the same time does not vanish for all these 


values of h. This function has a smallest value or a minimum 
if under the same conditions there exists the difference 


[2] Ff (#y + h) — fF %o) = 9, 
which does not vanish for all values of h between — 6 and + 6. 

A function may have several such maxima and minima which 
may be different from one another; it may have minima which are 
greater than maxima. (See 
the accompanying figure.) 
When the existence of com- 
plete derivatives in the 
entire interval under con- 
sideration is presupposed, 
the maxima and minima 
which may be derived are called ordinary; but when we have to do 
with functions whose derivatives exist only on definite points or with 
functions which have one-sided derivatives and the like, the maxima 
and minima may be called extraordinary. The discussion will be 


Fre. 1 


* See Genocchi-Peano, Calcolo differenziale e principii di calcolo integrale (§ 131). 
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restricted at first to ordinary maxima and minima. A maximum of 

f(a) is called proper by Stolz (Grundziige der Differential- und Inte- 

gralrechnung, Vol. I, p. 199) if in the formula [1] only the sign < 

stands ; while we have a proper minimum if there stands only the 

sign >in [2]. The maximum and minimum are improper if in formu- 

las [1] and [2] the sign =also appears, however small 6 may be taken. 
2 


1 
For example, y =(«sin—} has the value + 0 when z= — for 
a) nT 


consecutive integral values of , however large, and that is for 
intervals as small as we wish. Stolz and others * use the notation 
extreme or extreme value of a function to denote indifferently 
either the maximum or minimum of the function. 

The maximum and minimum of a function defined as above 
are often denoted as absolutet maximum and minimum, since 
they depend upon the collectivity of the values of f(z). Opposed 
to them appear the relative maximum or minimum, which enter 
if the independent variable x is subjected to a restriction so that 
A in the formulas [1] and [2] can take only restricted (and not 
arbitrary) positive and negative values. 

2. If the function /(«) has for =, a positive derivative 
F'(), the function is becoming larger on this position with in- 
creasing x, and its values are respectively smaller or greater than 
those of f(z) according as # is smaller or greater than 2. It is 
assumed that x lies sufficiently near 2p. 

In this case the function /(«) has for # = x, neither a maximum 
nora minimum. Similar (mutatis mutandis) conclusions are drawn 
if f'(a) is negative. 

It follows that if the function f(x) has for x= x, a finite de- 
rivative that is different from zero, then on this position the function 
has neither a maximum nor a minimum. 

If then we exclude from the values of x those to which a defi- 
nite derivative (different from zero) corresponds, there remain either 


* Eutremer Werth was used by R. Baltzer, Elem. d. Math., Bd. I, Aufl. 5, S. 217: 
Extremum by P. du Bois-Reymond, Math. Ann., Vol. XV, p. 564. ' 
t The authors just cited, as also Peano, understand by the absolute maximum and 
minimum of a function in a given interval the upper and lower limits of the function in 
this interval, if such limits are reached. Seealso A. Mayer, Leipz. Ber.(1899), p.122, and 
Lipschitz, Analysis, Vol. II, p.306, and in particular Voss, Encyklopédie der Math. Wiss 
Bd. Il, Theil I, Heft I, 8. 80, who remarks upon the weak terminology of the subject, 
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those positions on which the function has no derivative (finite or 
infinite) or those places on which it has a vanishing derivative. 

These positions must be further examined if we wish to make 
ourselves sure of the existence or nonexistence of a maximum or 
minimum. No rule can be given for the cases where derivatives 
do not exist. 

If we assume that the derivative is zero, the following criteria 
may be used: If f(x) has the derivative f’ (z) in the interval 
(2) —h+++ +h), we have, in virtue of the Taylor formula,* 

SF (#) — f(y) = (@ — po) f" (a), 
where z, lies between x) and x. If f(x) becomes zero on the posi- 
tion # = 2 in such a way that it is positive for <x, and negative 
for «> x), then (« — x) f"(#,) is always negative, however «(# 2) 
be taken, and consequently it follows that /(x) < /(a,) for all values 
of x within the interval «,—h to «+h. The function will there- 
fore be in this case a proper maximum for «=2,. If, however, 
J'(#) is negative for <<, and positive for >a), the product 
(a — a) f'(x,) is always positive, and the function will therefore 
be a proper minimum for «=, within the interval in question. 

If /'(z) is zero, say, for values of # within the interval z --- @+h 
or within the interval ~,—--- %), we have cases of improper ex- 
tremes (maxima or minima). But if /’() retains a constant sign 
in the meighborhood of ~=.«,, then («—))/f"(a#,) changes its 
sign according as «>a, or #<,,and the function has neither a 
maximum nor a minimum. 

It is thus seen that the function f(x) has on the position x = ap 
a maximum or a minimum according to the manner in which 
its derivative vanishes for «=a; that is, according as we pass 
from positive to negative values or from negative to positive values 
with increasing x. It has neither a maximum nor a minimum 
if the derivative does not change its sign.t 


* See Pierpont, The Theory of Functions of Real Variables, Vol. I, p. 248. 

+ Leibniz, Vol. V, pp. 220-226, is the first who made a distinction between maximum 
and minimum. See Maclaurin, 4 Treatise of Fluxions (1742), Vol. I, any, ix, and 
Vol. Il, p. 695; and also Cauchy, Cale. différ., p. 63. With Leibniz, when = ='0, 7 is 
a maximum if the curve is concave towards the x-axis, a minimum if the curve is 
concaye away from the x-axis. 
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3. Instead of considering the sign of the derivative in the 
neighborhood of , if we consider the sign of the second deriva- 
tive for «= a, (when this second derivative is different from zero), 
we have the rule: 

The function f (x) has on the position x= x, for which f' (a) = 9 
a maximum or a minimum according as f''(%) ts negatwe or 
positive. Infinite values are always included wnless it 1s stated 
to the contrary. 


In fact, if f’"(x))<0, then f(z) is a decreasing function, and 
since it is zero for z = a, it goes from positive to negative values ; 
the inverse is the case if /’’(x))>0. 

This rule leaves one in the lurch if /’"(z») = 0. 

If in general it is assumed that 


F'(@y= 9, fF (Gq)=9, 22+, FO PME)=9, SOG) * O, 
it follows from Taylor’s formula that 


F (0) — f(%) =" foray), 

where «, is situated between x, and z. 

As here f(x) is assumed to be a continuous function, it retains 
a constant sign in the neighborhood of x). If is odd, the factor 
(w— x)" changes sign according as x >a, or <a). Consequently 
F(&) — f(%) also changes its sign and f(#,) is neither a maximum 
nor a minimum. If m is even, the factor (w— ,)” is positive and 
iF (x) —f (0) has always the sign of f™(a,). It follows that f(z») 
is @ Maximum or minimum according as f(z) is negative or 
positive. We therefore have the theorem: * 


If for «=x, the first and some of the following derivatives 
vanish, then f(a) 1s or is not an extreme according as the first 
nonvanishing derwative is of even or odd order. If it is of even 
order, there is a maximum or a minimum according as the 
derivative in question is negative or positive. 


* See Maclaurin, A Treatise of Fluxions, Vol. I, p. 226; Vol. Il, p. 695; and also 


Lagrange, uvres, Vol. I (1759), p. 4. It was Maclaurin who fi 
method of distinguishing maxima from minima. Yee Oa 


\ 
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4. The following may be regarded as a résumé of what has 
been given above: The function /(x) is supposed to be uniquely 
defined for all values of « within an interval (a, b), and a, is a 
point of this interval. The function /(x) is a proper maximum 
or a proper minimum for «= %, if we are able to find a positive 
number 6 sufficiently small that the difference f(2)+h)—f (xp) 
retains a constant sign when / varies from —& to +6. If this dif- 
ference is positive, the function /(”) is smaller for «=, than it is 
for the values of x neighboring 2); it is then a proper minimum. 
On the contrary, when the difference /(7,+h)— f(x) is negative, 
the function is a proper maximum for «=a. Hf, furthermore, the 
sign =enters in the cases just mentioned, however small 6 be 
taken, we have an improper minimum or maximum. 

When the function /(~) admits a derivative for the value a, of 
the variable, this derivative must be zero. In fact, the two quotients 
fle th)—F lm) f(a — 2) — F(2) 

h —h 
which have here by hypothesis the same limit /’(x)), when / tends 
towards zero, are of different sign; it is necessary then that their 
common limit /’(x,) be zero. 

Inversely, let x, be a root of the equation /'(z)= 0, situated 
between a and 6, and taking the general case suppose that the 
first derivative which is not zero for v=, is the derivative of 
the nth order and that this derivative is continuous in the neigh- 
borhood of the value x). The general formula of Taylor gives 
here, limiting it to the term of the nth degree, 


Flo+)— fla) =F (e+ 62) (0<0<1) 


=" [£4] 


where ¢ is a quantity that is indefinitely small with h. Let 6 bea 
positive number such that as # varies from x)—6 to w+ 5 the 
absolute value of ¢ is smaller than /™ (x), so that /(%)+h)—f(%p) 


has the same sign as “ {™(x). Tf. is odd, we note that this dif- 


ference changes sign with h; there is then neither a maximum 
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nor a minimum for #=2,. If m is even, f (t+ h) —f(#) has the 
same sign as f)(x)) whether h be positive or negative ; the func- 
tion is a minimum if f™ (ap) is positive, and a maximum if f (a9) 
is negative. It follows that for the function to be a maximum or 
minimum for «=, it is necessary and sufficient that the first 
derivative which vanishes for x = 2, be of even order.* 

In geometric language the preceding conditions denote that the 
tangent to the curve y=/(z) at the point 4 is parallel to OX 
and is not an inflectional tangent (see Figs. 2-5). 


I A erg 155 


Il. EXTRAORDINARY MAXIMA AND MINIMA 


A. FUNCTIONS WHICH HAVE DERIVATIVES ONLY ON 
DEFINITE PosItIons 


5. Let the function y= /(z) be uniquely defined for all values 
of w between — 9 and 2+ 6 and suppose that it is continuous 
for “=. If the expressions 


ft N= Ft) nq Lea MS) © Gao) 


* 
° a eee Cours D’ Analyse, Vol. I, pp. 108 et seq. I shall refer hereafter to 
his work by the name of the axthor, and by Peano and Stolz I shall designate the 
works, cited above, of these two mathematicians. 
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have limiting values when lim i =+ 0, each of these expressions 
is called a one-sided differential quotient,* the first the right-hand, 
and the second the left-hand, differential quotient of f(x) with 
regard to « for the value z= a. 

If the two one-sided differential quotients of f(x) are equal to 
each other for «=, their common value is called the complete 
differential quotient of /(~) with respect to # for v= zp. 

If next it is assumed that the one-valued function f(x) is con- 
tinuous for all values of the interval (a, 6) and has at least one- 
sided differential quotients, the differential calculus offers a method 
for the determination of the maxima and minima. For if f(z) is 
such an extreme of f(x), the quotient 


S (+h) —S (®) 
h 


must necessarily either vanish or change sign with h. 

It may therefore be concluded, as in § 2, that the complete 
differential quotient f’(~) must be zero, and that if the right- 
hand and left-hand differential quotients of f(x) are different at 
the point «= 4), they cannot have the same sign. These require- 
ments are under the existing conditions necessary that /(x)) be 
an extreme of f(z); however, as it will be seen in the following, 
they are not always sufficient. 

6. Criteria as to whether a root «= x, of the equation f'(x)= 0 
offers an extreme of the function f(x).t 


TuEorEM I. If /’(x) vanishes for =), and if a positive 
quantity 6 may be so chosen that f(x) has complete differential 
quotients in the interval (v,—6---«)+ 6), and if f"(z) changes 
sign neither in the interval (c,—6---) nor in the interval 
(9 +++ %+ 6) and also does not remain invariably zero in either 
of these intervals, then f(x,) is or is not a proper extreme of 
f(#) according as the sign of f(x) in the first interval is different 
from or the same as it is in the second interval; and further- 
more, in the first case f(z) is a maximum or a minimum of. 


* See P. du Bois-Reymond, Math. Ann., Vol. XVI, p. 120; see also Pierpont, The 


Theory of Functions of Real Variables, Vol. 1, p. 223. 
+ See Cauchy, Calc. différ., Lesson 7, and see in particular Stolz, pp. 201-210. 
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f(x) according as f'(7) on its passage through zero, as « with 
increasing values passes through 2), changes from the sign + to 
the sign — or from the sign — to the sign +. 

This theorem is stated at the end of § 3 and there proved; and : 
as also indicated in that section, the inconvenience arising due 
to the consideration of the sign f’(z) may be obviated if the func- 
tion f(z) has a complete second differential quotient for x= 2). 
This leads to the following theorem: 


TuEorREM II. If under the conditions assumed in Theorem I 
the function f(z) has for «=, a complete second differential 
coefficient f’"(%)) which is not zero, then f(z ) is a proper ex- 
treme of f(z), being a maximum or minimum according as /’’(<») 
is negative or positive. ; 

Due to the definition of a complete second derivative it follows 
that ' pss 

f er’ mC) = f(a) + R(h), 
where #(h) is a quantity that becomes indefinitely small with h. 

If here the existence of a second derivative of f(x) is assumed 
only for «=a, then, since f’(x))= 0, there corresponds to every 
positive quantity « another quantity 6 such that, if —d<h<6, 


we have ; 
Fm) +> Tt s g(a) — 


If, say, f'(%) 18 positive, it follows at once that there must be 
a positive quantity 6 such that for —d<h<6 we have 


[3] of "(t+ h)> 0. 


Hence /"(v)+h) must be negative when h is negative and posi- 
tive when / is positive, so that on passing through zero (i.e., when 
&= &»), f'(% +h) passes with increasing « from a negative value 
to a positive value. Accordingly, in virtue of Theorem I, J (%q) is 
a proper minimum. 

The above theorem becomes the one given in § 3 if it is 
assumed that there is an interval including the value x =, such 
that for all points within it a second differential quotient of J (2) 
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exists, and if it is further assumed that f(z) is continuous at 
least at the point «=a. 

TuzorEm III. If, furthermore, f’(x,)= 0 and f!"(«,) + 0, then 
J (&p) is not an extreme of f(z). 

For since here 


Fi ly +h) =f") = SIMO) to 


it is seen that as /’(«) passes through the value /’(x,) = 0, it does 
not change sign, and consequently /(2,)) is not an extreme of f(z). 

The two preceding theorems are special cases of the two 
following: 


THEOREM IV. If for the value «=~ we have 

SG) = 9, FM p)=9, +22, SEED (H)=0, SO” (xp) # 0, 
then /(Z,) is a proper extreme of f(z), being a minimum or maxi- 
mum according as f@"(x,) is positive or negative. 

For, owing to the supposed existence of the first 2% differential 
quotients, there is an interval z)—6---#)+ 6 throughout which 
the differential quotients /’(x), f(x), «++, f@*-) (x) not only exist 
but are also continuous functions of x. Accordingly, in virtue of 


Taylor’s formula, we have 
p2k-l if 
[4] P+) LO) = Bait Y(a)+ 0h), (0<@<1) 


which formula is true for all values of h such that —d6<h<6. 
Owing to the existence of f@” (zp), as in the case of formula [3] 
above, it is seen that for values of # such that — 6<h<6 we have 
F289 (5+) >0 or <0 

according as f/f”) (a) is positive or negative. 

If, then, f@(«,) is, for example, positive, it is clear that 
f@ (a+ h) is negative for values of 4 in the interval —6--- 0 
and positive for values of / in the interval 0--- 6. 

It follows from [4] that the difference f(x) +) —/(,) for all 
values of 2 within the interval —6.--+6 (excepting h=0) is in- 
variably + or — according as f®")(a) is + or —, and correspond- 
ingly we have respectively a proper minimum or a proper maximum. 
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If it is further supposed that f@” («) exists for all values of x 
in the interval z— 8-+-- +6 and that f@” (a) is a continuous 
function at least at w=), then, as in § 3, due to Taylor’s 
expansion we have 


[] fot h) =F (%)= Bit eo M), (0<0<1) 


from which the theorem is obvious. 

In exactly the same way we may prove 

TuEorEM V. If for v=, the 2 first differential quotients of 
F(a), Viz., f'(a%o)s F"(&p)) «++ F2"(ao), vanish, and if f@**) (a) + 0, 
then /(z,) is not an extreme of /(z). 


Remark. In the case that «=z, causes every differential quotient of 
f(z) to vanish, we cannot determine by means of Theorems II, III, and 
IV whether f(x) is an extreme or not. We must then apply Theorem I. 

1 


For example, it is seen that x = 0 is a minimum of f(r) = ez, 


TuEorEM V“. If the given function f(z) can be developed in 
the neighborhood of the point 2) in a series in integral positive 
powers of «—a,=/h so that 

S (®) =f (y+ & — 29) = Oph” + Cn h™t1+ «2, (Cm = 0) 
then f(x) is not an extreme of f(x) or is an extreme of f(z) 


according as m is odd or even; and f(x ) is a maximum or mini- 
mum according as ¢,, is negative or positive. 


For here f"(%9) = 0 = f'"(a) = +++ =f" (a), 
while FO @)y= eee 
This theorem may be proved directly by means of the property 
of series. For under the given assumptions corresponding to every 
quantity «€ >0, we may choose another quantity 8>0, such that 
— € <n hh Gn gl + ee 
If m is a positive integer and c¢,, <0, say, and if |A|<6, 
then SF (y+ h) — f (29) <2” (Cm + €) ; 
and as ¢ may be taken such that e<—c,,, the expression on the 


right-hand side is always negative, so that there is a maximum of 
f(#) at =a. Similarly, we may prove the remaining part of V*%. 
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B. FUNCTIONS WHICH HAVE ONLY ONE-SIDED DIFFERENTIAL 
QUOTIENTS OF A CERTAIN ORDER FOR A VALUE «= 2,* 


7. THEoreM VI. If the continuous function f(z) has for # = Ko 
one-sided differential quotients of the first order and of opposite 
sign (including + and —), then /(z,) is a proper extreme, 
being a maximum or minimum according as the right-hand 
differential quotient of f(x) is negative or positive. 

For if, say, the left-hand differential quotient is positive, the 
right-hand one being negative, then there exists a positive quantity 
6 such that according as —d6<h<0 or 0<h<6, we have 


J (% th) — fF (Zp) SyiW Gee 
h 


It follows that /(z,+ h)— f(z) <9 for all values of h that are 
situated within the interval —6.-. +6. Hence /(2,) is a proper 
maximum. 

THEOREM VII. If for =~, the 2h first differential quotients 
of f(z), viz., f(z), f(z), +++, f2"(az), vanish, and if f@(x) has for 
z=, one-sided differential quotients of contrary sign (+o and 
—o included), then the value /(x)) forms a proper extreme of 
(2), being a maximum or a minimum according as the right- 
hand differential quotient is negative or positive. 

If, for example, the left-hand differential quotient is positive, 
while the right-hand is negative, that is, 

POPC —W= JOP) 9 ang LM WIM) <o 
we note, since f@"(zx,) = 0, that f/@”(x)+ h) <0 for all values of h 
within the interval — 6--- + 6 (the value 4 = 0 excepted). Hence 


from formula [5], viz., vs 
pee : 
J (@ot h) —f(%) = anit ort Oh), 
it is seen at once that f(x) is a proper maximum. 
THEOREM VIII. If for z=a, the 2k—1 first differential 
quotients vanish, viz. f'(%)=f"(%)=-++-=/f?*-» =0, and if 


* Stolz, p. 206; see also Pascal, Lxercici, etc., pp. 215-222. 1895. 
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f@k-Y(a) has for «=a, one-sided differential quotients of oppo- 
site sign (+0 and — included), then /(%») does not form an 
extreme of f(a). If, however, these differential quotients are both 
positive or both negative, then /(%) is a proper minimum or 
maximum of f(z). This theorem follows from [4] in the same 
manner as the preceding one did from [5]. 
Example. If f(x) = 2# («xZ0) and f(x) = (— 2)#(x<0), show by means 
of Theorem VI that there is a proper minimum at z=0 if p lies be- 
tween 0 and+1. Verify the same result when yp lies between 2k 
and 2k +1 by making use of Theorem VII; and by using Theorem IV 


show that f(x) is a proper minimum when yp is situated between 24—1 
and 2k. 


C.. UppER AND LoweER LIMITS OF A ONE-VALUED FUNCTION 
WHICH IS CONTINUOUS FOR VALUES OF THE ARGUMENT WITHIN 
A DEFINITE INTERVAL 


8. If the function f(z) is continuous and uniquely defined in 
the definite interval (a,b), there exist the greatest and the least 
value* in the interval in question, which are known as the wpper 
and lower limits of the function in this interval; and, further, the 
function reaches these limits; that is, if these limits are denoted 
by g and &, then there is at least one value ¢ of 2 within the 
interval a---6 for which the function is equal to g, and at least 
one. value d within the same interval for which the same function 
is equal to k. ; 

But if the interval within which « varies is indefinitely large, 
(a, ©) or (— ©, b) or (— ©, + ©), the function need not have a 
maximum or a minimum, and also it need not have an upper or a 


lower limit. This is illustrated in the following examples.t (See 
also § 96.) 


* Proofs of this and the following statements are found, for example, in Harkness 
and Morley, Intr. to Analytic Functions, §§ 46, 50; E. B. Wilson, Advanced Cal- 
culus, §§ 19-25. See Peano, Theorem IV, § 21, and also Dini, Fundamenti per la 
teorica delle funzioni di variadili reali (German translation by Liiroth and Schepp, 
§§ 36, 47). These proofs are founded upon Weierstrass’s lectures, which, in turn, 


are founded upon the work of Bolzano, Abh. d. Béhmischen Gesellsch. der Wiss., 
Vol. V, p. 17. 


t Peano, § 132. 
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Example 1. Divide a number into two parts so that their product is a 
maximum. (Cf. Ex. 6 at end of § 10.) 


Let a be the given number, x and a—~z the two summands, and 
y = (a—x) x their product. If we consider z as variable, we have y’=a—2z2, 


which becomes zero for n= 5. We further have 7” =— 2, so that the 


function y has a maximum for z = = that is, when both parts are ecual 


this value being y = a 


Since, however, the derivative 7’ is positive for x < © and negative for 


oe <, it follows that the function increases in the interval (- 0, s) and 


= 


decreases in the interval (5. = ): The function has neither an upper nor 
a lower limit. 


Example 2. y =z”. (z > 0) 


Through differentiation we have y =2*(1+logz). The first factor 
is never zero and is always positive. The second factor becomes zero 


when logz =—1 or x= - The derivative passes therefore from negative 
e 


(for < *) to positive values (tor L *). The function has a minimum 
e€ e 


for — == = 0.36788 ---, which is y=0.676411---. This is also the lower 


limit eink the function takes in the interval (0, ©). The function does 
not have either a maximum or an upper limit. 


» f= Ret. 

The derivative is zero for no finite value of z, but is infinite for 7 = 0. 
For this value y becomes zero, and the function will have at this point both 
a minimum and a lower limit with respect to the interval (— ©, + ©); for 
all the values of 2 cause the function to be greater than zero. The function 
has neither a maximum nor an upper limit. 


alte 


Example 3. y= 23 


9. If we add to the postulates already made in the previous 
article regarding f(z) that it must have a complete differential 
quotient for all values of # between a and 0, then /"(x) vanishes 
for every value of x between a and } to which one of the values g 
or k of the function belongs. If, however, f(a) = g, say, then possibly 
f(a) is only a one-sided maximum of f(z), and consequently /"(a) 
is not necessarily zero. This must be borne in mind as we proceed 
with the problem of determining the numbers g and k. This is 
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explained by the simple example (see Liowville’s Journal, First 
Series, Vol. VII, p. 163): 

In the plane of a circle which is described about the point O as 
center with radius 7, let there be given an arbitrary point 4 which 
is different from O. Determine the upper and lower limits of the 
distances of the point 4 from a point M of the circumference. 

Let the positive X-axis be taken passing through O, and standing 
perpendicular to it through O is erected the Y-axis. The equation 
of the circle is then «2+ y2= 72; while 


AM” = (a — #)?4+ y2= 172+ a — 2 az. (a) 
As M passes over all points of the circumference, « takes the values 
in the interval —7r-.---+7. The linear function (a) decreases with 


increasing values of «, its 
differential quotient being a 
negative constant equal to 
—2a. Consequently those 
values of x to which the 
upper and lower limits of 
AM” belong, fall on the 
end-points of the interval 
—r.--+~r, It is seen that 
—r corresponds to the upper limit and +7 to the lower limit, giving 
us as upper and lower limits respectively |a +7| and |a —7|. 

10. Suppose next that the function f(z) is discontinuous at least 
on an end-point of the arbitrary interval (a, 6); for example, sup- 
pose that the function is not defined at such a point. If this is 
the case only for the limw=a, then’in the derivation of the 
upper and the lower limit we must consider in particular the value 
of f(«) for the lim «=a+0. The following examples will make 
clear the method of procedure (see Stolz, p. 210). 


ie 


Fie. 6 


Example i, Consider the function y = ae for values of x such that 
: , - 02 x& 
0<2« <1. It is seen that y is negative and decreases with increasing values 
of «. For w F 
x. For when limx=+0, then limy=—0; 
and when limz=1-—0, then limy =—o. 


Thus the upper limit of y in this interval is zero, While the lower limit is 
— %, although neither is reached. 
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Example 2. 7 =(1— 2) sin uf (0<2=1) 
x 


For these values of x we have always |y|<1. 
If we consider only values of x such as x = ——~— 
integer), we have 4n+1 


y=(1- p= 5)sin En +1)=1- le 
dn4+1 2 4n+1 


Hence when n == + ©, the upper limit of y is +1. 
) 


(where n is an 


By writing 2 = z = 1’ it is seen that the lower limit is —1. Neither 


the upper nor the lower limit of y is reached, although in either case 
they are finite. 


PROBLEMS 


1. Determine the maxima and minima and the upper and lower limits of 


log « i 
(@) y= (> 0) ott 
OR ; 
(b) y=acosz + bsinz. ee +1 


(©) y=at a, (Pierpont, p. 320.) 


1 
: (g) y= 2? — et? 
(d@) y=1—2%. (Maclaurin, Vol. II, 


1 


p- 720.) (h) y= ez, 
“pal : 1 
Nerina o (CNG has -s 
(©) @ ein — (The function has a Gye Hine nC Thateie sore 
discontinuous derivative for treme on the position 
r= 0.) a 0.) \ 
Fes CLF 
2. Show that the function ie Se Na Cay) 
F0)=9 


has an infinite number of maxima and minima within the interval 


(—1---+1). 


3. When is mp + nq a minimum, where p= Vert y, d= Vie + Gar’ 
(Leibniz, 1682.) 


4. “Invenire cylindrum maximi ambitus in data sphaera.” (Fermat, 
CEuvres, Vol. I, p. 167. 1642.) 


5. Find the area of the greatest parabola which may be cut from a 
given cone. 
5 (Euclid, Book VI, 
Prop. 27.) Cantor (Geschichte der Math., Vol. I, p. 228) says that this is the 
first.example of a maximum in the history of mathematics. 


6. «(x—a) has its greatest value when x = 
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7. On a given line AB are two fixed points P, and P,. Determine a 
APB 
PPO PE, 
and Fermat, Quvres, Vol. I, p. 140.) 

8. Of all sections which pass through the vertex of a cone, determine 
the one of greatest area. (Severus.) 

9. The number a is to be divided into two parts, such that their product 
multiplied into their difference shall be a maximum. (Tartaglia, General 
Trattato, Part 5, fol. 88.) 

10. A ten-foot pole hangs vertically so that its lower end is four feet 
from the floor. Find the point on the floor from which the pole subtends 
the greatest angle. (Regiomontanus. 1471.) 


third point so that isa minimum. (Pappus, Book VII, Prop. 61, 


11. The curve y = 7? — 
rechnung, Vol. III, p. 744.) 


12. Two points P, and P, not on the straight line AB are given. Find 
a point P on AB such that PP? + PP; isa minimum. (Solved by Huygens 
possibly about 1673. See Huygens, Opera Varia, pp. 490 et seq. Note the 
letters of De Sluse.) 


13. Derive the greatest rectangle that can be described in, and having 
one of its sides, upon the base of a given triangle. (Simpson, Elements of 
Plane Geometry (1747), pp.106 et seq. In this work are also found numerous 
problems that have to do with areas, volumes, etc.) 


; has no minimum. (Euler, Differential- 
log x : 


CHAPTER II 
FUNCTIONS OF SEVERAL ; VARIABLES 
I. ORDINARY MAXIMA AND MINIMA 
PRELIMINARY REMARKS 


11. We say that the function w=/(«,, %,+++, Z,) becomes a 
maximum or minimum on the position (a, d,+-+-, @,) if for a 


sufficiently small region about (a,, a,-++, @,) we have 
WACZE Boasts Gn) = f(t, a) Ln) 
or J (a, Ba,***; Gn) S f(t, a) Ln). 


These extremes are proper or improper according as the sign = 
does not, or does enter. 

As in §1, it is assumed here that the function has definite 
partial derivatives which are continuous within the region in ques- 
tion with regard to each of the variables; and the extremes which 
may be derived we shall call ordinary. If the partial derivatives 
do not have such derivatives, the extremes may be called extraor- 
dinary. Such extremes in their generality we shall not attempt to 
consider. Another class of extraordinary extremes is mentioned 
in § 13, and is later treated in its generality for the case of func- 
tions of two variables ($§ 20 et seq.). 

12. Consider the function of one variable x, viz, f(a, 4, 

++, Q»). If the function w of the preceding article is a maximum 
or minimum for 7,= @,,+°++, %,= @,, then f(a, @1, +++, 4) will be 
a maximum or minimum for #,= a,. Hence (see § 2) the derivative 
Se (%; Qy,+++, 4) must be zero. Similar conclusions may be made 

for the other variables in w. 
It follows that if u =f (x1,-+++,%p,) has an extreme on the position 


(a4, Mg, +++, Ay), the first partial derivatives of wu must be zero. 
ile 
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(See Euler, Cale. diff: (1755), p. 645; and Lagrange, Théorve des 
Fonctions, Vol. II, No. 51.) 
Write* next v= a,+ Ayl, +++, t= Ot h,t, and put 
F(t)=f (44+ Myt, +++) Gynt Anl). 


If w= f(a, +++, %) 18 an extreme on the position (4,,-++-, %), 
then F(t) is an extreme on the position ¢ = 0. 

Since by hypothesis the derivatives of w are continuous, it 
follows also that the same is true of F(t). 

We consequently may write 


F'(ty=fz, (24, By, ++, Ly) hy thr, (yp Zq,+**,lm)hgt +> 
os. (24, pO ee a, Ln) hy. 
It follows from § 2 that 
F'(0)= fa, (44, > yehse An) hy + car. > tee (41, ae) An) hy= 0, 
whatever be the values of h,, h,,---,h 
We therefore have 


Sa, (A, 22, Oy) = 0,-- Sa, (4p =< ty Sn) 0, 
as was just seen. 
We further have 
H(t) = foie, (®yy Way +++) Gy)hE + fins, (yy Lys + + +4 Lyyhgt +++ 
2g 2 fz,0,(%, nse An) Rghig + ae a 

If w is to be an extreme for the position under consideration, then 
F(t) must be an extreme for ¢ = 0, so that for a maximum we must 
have (§ 3) F'"(0) = 0, and for a minimum F”(0) = 0, whatever be 
the values of h,,h,,+++,h,. If for the time being we omit the sign 
= from the two expressions just written, we have the theorem: 


nm e 


In order that the function w be an extreme at the position 
(44, +++, Gn) for which the first derivatives vanish, it is necessary 


that the following homogeneous function of the second degree in 
hy, +++, hy, viz, 


i) = Sava, (A Ua, +++, Uy) hi + fala, (Oy, Bog,e*s, An) hot aes te 
a 25 jee, (4s yyie nis An) hihg+ tee, 


* See also Peano, § 134. 
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assume only positive or only negative values, whatever be the 
values of hy, +++, hy, except when these quantities are all simul- 
taneously zero. 


13. We distinguish three kinds of integral functions of the 
second degree, or as they are usually called, quadratic forms,* viz., 

I. Definite forms, which with real values of the variables have 
always the same sign, that is, only positive values or only negative 
values, and are only zero when the variables are all zero. 

Il. Semi-definite forms,t which always have the same sign, 
but which vanish also for other values of the variables that are 
not all zero. 

III. Indefinite forms, which with real values of the variables 
can become both positive and negative, and that too for values of 
the variables whose absolute values do not exceed an arbitrary 
small quantity. 

The theorem of the preceding section may be written as follows: 

If for #,=4,,--+, %=4, the first partial derivatives of the 
function w=/(#,,+++, #,) vanish, and if in the Taylor develop- 
mentt for f(z, +h,, +++, Z,+h,) the term which is a homogeneous 
function of the second degree in /,,---, i, is an indefinite form, 
then w on the position (a,,---, @,) has neither a maximum nor a 
minimum value. If, on the other hand, that term is a posztive defi- 
nite form, then w is a minimum, and if it is a negative definite 
form, vu is a maximum. 

The case where the form is semi-definite is included under the 
extraordinary extremes, and we shall consider it later (§§ 20 et seq.). 

14. Next is given a criterion to determine whether a given quad- 
ratic form $(h,,-+++, Am) is a positive definite quadratic form. 

If @ depends only upon one variable h,, we shall have 6=<Ah a 
and this is positive when and only when 4 is positive. 

If @ depends upon two variables h, and h,, we shall have 


hb = Ah? + 2 Bhihy + Chg. 


* See Gauss, Disq. Arithm., p. 271. 

+ So called, for example, by Scheeffer, Math. Ann., Vol. XXXV, p. 555. Gergonne, 
Gerg. Ann., Vol. XX (1830), p. 331, called attention in particular to this case. 

{ This development is found in full in § 50. 
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If here ¢ is a positive definite form, it follows that for h,= 0, 
h,#0, then ¢=Ahj, and consequently 4 must be positive. We 
may also write ¢ in the form 


p= ‘ [ (Ah, + Bhg)? + (AC — B?) bel. 


If in this expression we give to h, and h, such values that 


Ah, + Bh, = 0, it is seen that ¢ takes the form ¢ = “(Ae — By h2. 
We must therefore have AC — B?> 0. 

The conditions A >0 and 4C— B?>0 are not only necessary, 
but they are also sufficient that ¢ be a definite quadratic form. 
In fact, if h, #0, we have (AC — Bh? >0 and (4h, + Bh,)?= 0, 
and consequently the sum of these two expressions, and also ¢, is 
positive. 

If, in general, @ depends upon several variables h,, he, hs, ++ >, 


we may write 
fh = Ah? + 2 Bh, + C, 


where A is a constant, B a form of the first degree in hg, hs, +-- 
and Ca quadratic form in hy, hg, ++ + 

If hy, hs,+++ are all zero, but h,# 0, we will have B and C zero 
and ¢= Ah? We must therefore have 4>0, if ¢ is to be a 
positive definite form. 

The form may be written 


> 


$ =< [(Ah, + B+ (AC — BY) 


where AC — B? is a quadratic form of h,, h3,---. The quantity 
h, may be determined so that 4h,+ B= 0 with the result that 


L 


Hence the expression 4C—B? must be positive and different 
from zero. 

Next write AC — B= (he, hg, ++ -), where ¢, is a quadratic 
form in hy, hs,--+ which is always positive and different from 
zero except when all the variables vanish. 
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It follows that the necessary conditions that ¢ be a definite 
positive form are (1) that A be greater than 0 and (2) that 
AC— B* be a positive definite form in the variables hy, gy ++. 

These conditions are also sufficient; for if we give to h, an 
arbitrary value and to hy, h,,--+ arbitrary values which are not 
all zero, then of the two summands into which ¢ is distributed, 
the first is positive or zero, while the second is positive. It follows 
that ¢ is positive. On the other hand, if we give to hp, hg, ++ 
simultaneously the value zero, then h, must be different from 
zero, and from @= Ah? it is seen that A must be positive. In 
this way the determination of the question whether a quadratic 
form is definite and positive is reduced to the determination of 
the same question in the case of another quadratic form of fewer 
variables. If then the process is continued, we come to the forms 
in one or two variables already considered. This subject is further 
considered in §§ 53 et seq. 

To determine whether a quadratic form ¢ is definite and nega- 
tive, we have to determine whether — ¢ is definite and positive. 
(See Peano, § 137.) 


Il. RELATIVE MAXIMA AND MINIMA 


15. To introduce the theory, we shall consider here a simple 
case involving only three variables. Let it be required to deter- 
mine the extremes of the function 


u= F(x, y, 2), 


where the variables xz, y, 2 are restricted. Suppose, for example, 
that they are connected by the equation 


SF (#; Y, 2) = 0. 


If from the latter equation z is expressed as a function of « 
and y, and if this value is substituted in the first equation, we 
shall have w expressed as a function of wand y. The values 2, y 
which make w a maximum or minimum cause the total derivative 
dw to vanish for all values dx and dy. 
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ee = dis ae. 
We have =— = a Si 
where dz denotes the Rac? of z, which is defined through 


ii 
the equation OS te Sy en 
on oy Oz 


If this last equation is multiplied by the indeterminate quan- 
tity — 2 and added to the equation du = 0, we have 

/ ‘ee 

(4-22) ax v+(F ae ay +(F nD) az = 0. 
Ox Ox oy Oz 

If in this equation we choose A» so that the coefficient of dz 
vanishes, then corresponding to the maxima and minima values 
of w the coefficients of dz and dy must also be zero, and we thus 
have the equations 


OF Ty? Pe eee 
0x 0x oy oy oz oz 


It is evident that we have these expressions which are sym- 
metric with regard to the three variables if we form the three 
partial derivatives of “—2/, where 2 is an indeterminate quan- 
tity, and then put the resulting expressions equal to zero. 

These three equations, together with the two equations f= 0 
and u =F, determine the unknown quantities A, 7, y, z, u, which 
correspond to the values of w for which there exist maxima and 
minima values. 

We may proceed in the same manner with an arbitrary number 


of variables and equations of condition. (See Lagrange, Théorie des 
Fonctions, p. 268.) 


PROBLEMS 
1. Find the minimum value of u, where 
U= 27+ y® + Zr 4 0, 
and where 2, y, 2,--- are connected by the equation 
ax + by +cz+---=k, 
2. ir, +2. +--+ +a, =a, show that 
r+ okt... + kh 


is a minimum when 2, = 4, =-++ =a). (Maclaurin.) 
: . 


CHAPTER III 
FUNCTIONS OF TWO VARIABLES 
I. ORDINARY EXTREMES 


16. Let z=/(x, y) be a continuous function of the two vari- 
ables z and y when the point P with codrdinates (x, y) remains 
within the interior of an area © which is limited by a contour C. 
We say that this function /(z, y) is a minimum for a point (x, Yo) 
of the area 2 when we can find a positive quantity 6 such that 


h ; 
ala A=f (Zt h, Yo+h)—f (Los Yo) = 9 (2) 


for all systems of values of the increments # and & that are 
less than 6 in absolute value. The maximum is defined in a 
similar manner.* 

If we exclude the sign = in the expressions A2=0 or A=0, 
the extremes are said to be proper (cf. § 1); but if the equality 
A=0 exists for certain values of 4 and k that are less than 
6 in absolute value, however small 6 be taken, we have improper 
extremes. For example, in the case of the surface represented 
by the equation z=/(#, y), the axis Oz being vertical, a proper 
maximum corresponds to an isolated summit, but if these sum- 
mits form a line on the surface, this line will be a line of 
improper maxima. Consider, for example, the lines generated by. 
revolving the extremes of a plane curve about the Ov-axis. 

If in the expression (i) we regard y as constant and equal 
to Y, then z becomes a function of one variable « and (§ 2) 


the difference SF (Goth, Yo) —F (o> Yo) 


can only retain a constant sign for small values of f if the 


of 


derivative oe is zero for =X), Y= Yo. 


* See also Goursat, loc. cit. 
23 
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In the same way it may be shown that these values must 


also cause a to be zero. It follows that the systems of values 


which cause f(x, y) to become proper extremes are to be found 
among the solutions of the two simultaneous equations 
fied 0, af = 0); (@) 
Ox oy 
conditions which are also necessary for improper extremes. 

As only ordinary extremes are considered here, the partial deriva- 
tives of the second order of f(x, y) are supposed to be continuous 
(§ 11) in the neighborhood of the values x), y) and are not all zero 
for 2, Yo, and, furthermore, the derivatives of the third order are 
supposed to exist. If, then, z=, and y=y, are a solution of the 
two equations (a), the formula for Taylor’s theorem gives us 


A=f (ath, Yo+ k)—F(&o. Yo) 
a2 py) ae 
=f (whan +254) 


ee 02 0Yy ey? 
1 é \8 i 
=|(4—+hk—)f(a,y) |. 
as miltas - ie a, *) 
Y=Yot Ok 


For values of 4 and k in the neighborhood of zero, it is clear that 
the trinomial 


r ‘ 
mol + 9 nk is + eet 

a4 OL )0Y oy, 
gives its sign to the right-hand side of (i), and it is evident 
that the discussion of the sign of this trinomial is going to 
enjoy a preponderant réle. 

To have an extreme for =2), y=Yp, it is necessary and 
sufficient that the difference A retain a constant sign when the 
point (% +h, y+) remains within the interior of a square 
sufficiently small which has the point (%, Yo) for center. In 
this case the difference A will also retain a constant sign if 
the point (+h, y+) remains within a circle with radius 
sufficiently small and center (z,, Yo), and inversely ; for we may 
replace the square by the inscribed circle and reciprocally. 
Suppose, then, that © is a circle of radius 7 with the point 
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(%, Yo) as center. We have all the interior points of this circle 
by writing h=pcos¢,k=psing and causing ¢ to vary from 
0 to 2m and by causing p to vary from —r to +7. 

Making this substitution in A, it becomes 


2 3 
A=F (4 cos’ + 2 B sin ¢ cos g + C sin?) i 


92 2 2 
= Baa C= ae and where ZL is an expres- 
sion which retains a finite value in the neighborhood of the 
point (%, Yo). 

It is evident that several cases are to be distinguished according 
to the sign of B?— AC. 

17. First case. B*—AC>0. 

The equation A cos*?+ 2 Bsin¢ cos¢+ Csin?d=0 admits 
two real roots in tan q, and the left-hand side may be written as 


the difference of two squares in the form 


where A = 


2 3 
= = [a(a cos @ +b sin $)?— B(a' cos d + b/sin f)?]+ ae, 
where a>0, B>0, and (ab'—ba')#0. 


By taking the circle sufficiently small we may neglect the 
terms of the third and higher degrees in p. If next to the angle ¢ 
a value is given such that acos$+6 sing = 0, it is seen that A 
will be negative; while if we give the angle ¢ a value such that 
a'cos ¢ + b'sin ¢ = 0, then A will be positive. 

It is therefore impossible to find a number 7 such that the dif- 
ference A retains a constant sign when the absolute value of p is 
inferior to 7, while the angle ¢ is arbitrary. It follows that the 
function f(z, y) has no extreme for v= 2%, Y= Yo. 

18. Second case. B?—AC<0. 

It is evident that 4A and C must have the same sign. 

The trinomial 


A cos?h + 2 B cos ¢ sin d + C'sin?d 
= 5 [(A cos + B sin $)?+ (AC — B*)sin?$] 


does not vanish when ¢ varies from 0 to 27. 
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Let m be the lower limit of the absolute value of the trinomial 
and let H be the upper limit of the absolute value of the function 
L in a circle of radius & and center (Zp, Yo)- 3 

Let 7 be a positive number inferior to & and to a Within 


the circle of radius 7 the difference A will have the same sign as 
the coefficient of p%, that is to say, of A or C. The function /(, y) 
has therefore an extreme for 7= %, Y= Yp. 
19. The above results may be summarized as follows: If at 
the point 2, Y¥ we have 
ee 


Ox )0Yo]/ Omg CYyg 


there is no extreme; but if 
Ref \2_ otf B2 
(,20.) fo 
OL )0Y 9 0x5 0Y> 
there is a maximum or minimum according to the sign of the two 
TR 
. oa ey9 “co . . . 
There is a maximum if these derivatives are negative, a mint- 
mum if they are positive, and it is also seen that we have a 
proper maximum or minimum.* 


> 


Example. In the theory of least squares it is required to determine 2, y 


so that the expression roe 


(A) u(2, y)= > (ax + bpy + cr)? 
k=1 


may be as small as possible. In other words, determine the values of x and 
y tor which u(a, y) is equal to its lower limit. 
Following the methods indicated above we must solve the two equations 


ee c= 
Seta, Siow + bey + cp) an = 0, 
(B) - 
k=n 
1 éu 
D dy = > (ane + bey + cx) by = 0. 
eS k=1 


It is seen that the determinant of these equations is equal to the sum of the 
5n(n—1) squares (apb; — aby)? (k,l = 1, 2S eee 1), and this deter- 
minant does not vanish if among the binomials ayr + by there are at least 


* See Lagrange, Misc. Tawr., Vol. 1. 1759. 
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two whith do not differ from each other by a constant factor. Under this 
assumption the two equations (B) have one and only one system of solu- 
tions 2, ¥o- 

That w does in fact reach its lower limit for this pair of values is seen 
if we write in (A) z=2, + 6, y=y) + 7, and expand. We then have 


c=n 
U (2 +f, Yo + 9) — UZ» Yo) => (ae + dyn)’, 

k=1 
which difference is a positive quantity for every system of values except 
£=0, y= 0. 

PROBLEMS 
1. Find a point P of a plane such that the sum PA + PB+ PC of its 

distances to three fixed points of the plane is a minimum. In particular 
consider the case when BAC >120°, and show that here the point A gives 
the minimum. (Cavalieri, Exercitationes Geometricae, pp. 504-510. 1647.) 


2. In a plane triangle all of the angles have been measured with the 
same precision and found to have values a, B, y- On account of the 
unavoidable error in observation, the sum a+ 8+ y does not equal 180°. 
Let the difference 180 —(a + B + y) be equal to 6, where 6 is expressed in 
circular measure. What values u, v, w (in circular measure) must be 
added to the three results of measurement if we wish 


(1) thata+ B+y+utv+w=180, and 
(2) that u? + v? + w? be as small as possible ? 


Answer. w=48=v=w. 


INTRODUCTION TO THE AmBIGuoUS CASE Bb?— AC =0 


20. We shall first note the difficulties that attend this special 
case, and with Goursat* we shall illustrate these difficulties by 
means of geometric considerations ; we shall then call attention to 
erroneous deductions which have been, made, and later a method 
will be given, due to Scheeffer, of determining the extremes for 
this case, when they exist. 

Let S be the surface represented by the equation 2=/(a, y). 
If the function /(z, y) has an extreme at the point x, yp, and if 
the function and its derivatives are continuous, we must have 

eae an 
0X, OY, 


* Goursat, p. 112. 


b 
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which shows that the tangential plane to the surface S’at the 
point P, (with codrdinates 7%, Yo, %) must be parallel to the 
ay-plane. In order that this point shall correspond to an ex- 
treme, it is necessary that in the neighborhood of the point Po 
the surface S be entirely on one side of the tangential plane. We 
are thus led to the study of a surface with regard to a tangential 
plane in the neighborhood of the point of contact. 

Suppose that the origin has been transposed to the point of 
contact. The tangential plane being taken as the zy-plane, the 
equation of the surface is of the form 


z= aa+ 2 bay + cy*+ aa+ 3 Bary + 3 yay?+ by’, (2) 


where a, b, ¢ are constants and a, 8, y, 6 are functions of z, y 
which remain finite when « and y tend towards zero. To deter- 
mine whether the surface S is situated entirely on one side of 
the «xy-plane in the neighborhood of the origin, it is sufficient 
to study the intersection of this surface by the zy-plane. This 
intersection is a curve C' represented by the equation 


J (a, Y) = a+ 2 bry + cy?+ aav+.--=0, (2) 


and presents a double point at the origin. 
21. If ?— ac is positive, the equation 


at 2 bay + c= : [(ax + by)?— (P— ae) y] = 6 


represents two real and distinct straight lines which pass through 
the origin. Suppose that we take these two lines for the axes 
of coordinates, and note that this is brought about by a linear 
change of the variables. 

The equation (iz) then has the form 


cy + h(x, y)= 0. (v2) 
If in this equation we write y= ua, we have 
R(x, wx : 
“u=— at) ? (w) 


where it is evident that R(x, ux) is divisible by 2°. 
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It follows from § 140 (see also Goursat, § 34) that equation 
(wv) has one and only one root, say u = ¢ (x), which tends towards 
zero with x Hence through the origin there passes one branch 
of the curve C represented by an equation y= «#{(z), which is 
tangent at the origin to the axis Ox. If we interchange the 
role of the two variables x and y, it is seen that there also 
passes through the origin a second branch of the curve C which 
is tangent to the axis Oy. The point O is a double-point with 
distinct tangents. 

If, then, 2— ace >0, the intersection of the surface S by the 
tangential plane presents two distinct branches of curve C, and 
C, which pass through the origin, and the tangents to these two 
branches of curve at the origin are ie 
represented through the equation 


au+ 2 bayt+ cy?= 0. 


If we give to each region of the 
plane in the neighborhood of the 
origin the sign of the first term in 
(vit), a8 seen in the figure, it is clear 
that if a point moves along either 
of the curves C, or C,, the left-hand 
side of (az), and consequently also 
of (iz), changes sign as the point passes through the origin. It fol- 
lows that f(x, y) does not have an extreme (cf. § 17) at the origin. 

22. If }—ac<0, the origin is a double isolated point; for 
within the interior of a circle with sufficiently small radius 
described about the origin as center, the right-hand side of (7) 
only vanishes at the origin itself. To show this write x= p cos 4, 
y =r sing, where « and y are the coordinates of a point in the 
neighborhood of the origin. 

Equation (iz) becomes 

ST (%, y= p?(a cos*f + 26 sin $ cos d + ¢ sin?d + pL) 
where LZ is a function of p and ¢ which remains finite when p 
tends towards zero. Let H be the upper limit of |Z| when p is less 
than a positive number 7. 


Fig. 7 
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When ¢ varies from 0 to 27, the trinomial 
a cos?h + 26 sin ¢ cos d + ¢ sin? 
retains a constant sign. Let m be the minimum of its absolute 


value. It is clear that the coefficient of p? does not vanish for 
any point on the interior of a circle C with radius less than r and 


™, having the origin as center. Consequently the equation 
vad 


f(«, y)= 0 admits of no other solution than = 0, y= 0(e., p = 0) 
within the circle. 

It follows that f(x, y) retains a constant sign when the point 2, y 
moves within the interior of this circle. Hence, also, all the points 
excepting the origin of the surface S which may be projected upon 
the circle C are situated upen the same side of the zy-plane. The 
function f(z, y), therefore, presents an extreme at the origin (cf. § 18). 

23. When 0?—ac=0, the two tangents at the double point 
coincide, and there are, in general, two branches of curve tangent 
to the same straight line, which form a cusp. 

The complete study of this theory will be found to require a 
somewhat delicate discussion. 

For example, y*= 2 presents at the origin a cusp of the jirst 
kind ; that is, one which has the two branches of curve tangent to 
the Ox-axis lying the one above and the other below this tangent. 

The curve #?— 2 ay + «+— a= 0 presents a cusp of the second 
kind; the two branches of curve are tangent to the z-axis and 
situated on the same side of it. The equation gives us, in fact, 
y = 0+ a. The two values of y have the same sign in the neigh- 
borhood of the origin and are only real when z is positive. 

The curve a*+ 2y?— 6 ay + y2=0 presents two branches of 
curve which offer nothing peculiar, both being tangent at the origin 
to the «axis. We have from this equation 


_ 3 at+ g2V8 — 92 
1+ 22 


from which it is seen that the two branches obtained when we 


take successively the two signs before the radical have no singu- 
larity at the origin. 
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It may also happen that the curve is composed of two coincident 
branches, as is the case of the curve represented by the equation 


J(% Y= y— 22y +a4=0; that is, (y —#)?= 0. 


It is evident that here the left-hand side passes through zero 
without changing sign. 

It may also occur that the point x, y) is a double isolated point, 
as is presented through the curve 

y+ e+ y=0 
at the origin. 

From the above it is seen that if the origin is a double isolated 
point, or if the intersection of the surface with the tangent plane 
is composed of two coincident branches, the function f(a, y) will 
be an extreme (in the latter case just given an improper extreme); 
but if the intersection is composed of two distinct branches which 
pass through the origin, there will, in general, be no extreme, for 
the surface again cuts its tangential plane. 

24. Take, for example,* the surface 


z= (y — a) (y — 2 a”), 


which cuts its tangential plane along two parabolas of which the 
one is interior to the other. That the surface may not cross its 
tangential plane, it is necessary that if we cut this surface by any 
cylinder having its elements parallel to Oz and passing through Oz, 
the curve of intersection shall lie on one side of the xy-plane. 

Let y=¢(«) be the trace of such a cylinder upon the zy- 
plane, the function ¢(x) being zero for z= 0. If /(0, 0) is to be 
a minimum, the function f(z, ¢(x)) =F (x), say, ought to be a 
minimum for z= 0, whatever the function ¢(7). 

To simplify the calculation, suppose that we have chosen the 
axes of coordinates so that the equation of the surface is of the 


form z= Ay? + bs(a, y)++°°, 


where 4 is a positive quantity. 


* This is a generalization due to Goursat (p. 115) of the classic example of Peano 
(loc. cit., Nos. 133-136). 
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With this system of axes we have for the origin 


aE = 0, ee 0, 8 = 0, of 0, : ee >0. 
Oxy OY) Lo Oa CY) oy? 


The derivatives of F(a) are 
r= Z+29@) 


59 OF ays gry + Lore) ee 


fC Ol a oxoy 


For <= 0=~y these formulas become 
P'(0)=0, (0) = 2 [H'OVP. 
? oy? 
If ¢/(0) #0, the function #(z) evidently has a minimum for 
x=0; but if $'(0)= 0, it is seen that 
F'(0)=0, F'(0)=0, F!(0)= a 
and Fr) (0)= os + 8 Fa8y, Sa mieihs 3 eh gro 
#9 
Hence, in order that F(z) = a minimum, it is necessary that 
3 
oy be zero, while 


x3 
of 6 
mer an $''(0) + 3 ae" OP 


must be positive, whatever the value of r (0). 
These conditions are not satisfied for the surface 


z= y*— 3 ay + 2 at 
considered above, while they are satisfied for the function 
dim y+ aA, 

It is thus seen that in the ambiguous case, where B?— AC= 0, 
the derivation of the necessary and sufficient conditions for 
the extremes of functions of only two variables is going to be 
accompanied by difficulties. It is also evident that in the case 


of three or more variables these difficulties will be correspond- 
ingly augmented. 
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Il. INCORRECTNESS OF DEDUCTIONS MADE BY EARLIER 
AND MANY MODERN WRITERS 


25. One of the greatest mathematicians of all times, Lagrange 
(Théorie des Fonctions, p. 290), writes : 

If all the terms of the first and second dimensions [see formula (ii) of 
§ 16] vanish, it is necessary for the existence of a maximum or minimum 
that all the terms of the third dimension in h,, h,,--+ shall disappear and 
that the quantity composed of terms where h,, h,,--+ (cf. § 51) form four 
dimensions shall be always positive for the minimum and always negative 
for the maximum when 4h,, h,,--+ have any values whatever. 

Following Lagrange, all writers on this subject made the same 
incorrect deductions until Peano, in the remarks to Nos. 133-136 
found in the Appendix to his Calcolo, wrote: “The proofs for 
the criteria by which the maxima and minima of functions of 
several variables are to be recognized, and which are given in 
most books, depend upon the theorem that in the Taylor develop- 
ment for functions of several variables the ratio of the remainder 
after an arbitrary term to this term has a limit zero when the 
increments of the variables approach zero. This theorem is in 
general false when the term in question is not a definite form 
with respect to the increments of the variables, and when it is a 
definite form, the theorem needs proof.” 

These fallacious conclusions are found, for example, in Bertrand 
(Calcul Différentiel, p. 504), and also in Serret (Calc., p. 219), 
who writes: 

The maxima or minima exist if for the values h,,h,,--- which cause d?f 
and df to vanish the derivative df has invariably the minus or plus sign. 

Here d?f, d?f,.--- denote the homogeneous integral forms of 
the second, third,--- degrees in h,, A,,+++, when the function / 
is expanded by Taylor’s theorem (cf. § 51). 

Todhunter (pp. 227-229 of the 1864 and 1881 editions of 
his Calculus), for the semi-definite case where B?*— AC=0, writes 
the Taylor expansion for a function of two variables in the form 
(see (it) of § 16) x eT h 


2 
=o A+B) +( a+ By 


where fA, is the remainder term. 
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The condition which it appears that he considered as suffi- 
cient for an extreme is that A and &, must have the same 
sign, and if the terms of the second dimension are zero for 
the position or positions in question, then also the terms of 
the third dimension must be zero. 

That this is not true is seen at once by observing Peano’s 
classic example f(x, y)=(y — pa?) (y — 2), 
where the conditions just mentioned exist, although there is no 
extreme at the origin, as already seen in § 24. 

Professor Pierpont (Bull. of the Am. Math. Soc., Vol. IV, p. 536) 
says, “Our English and American authors seem to be ignorant 
of these facts.” 

Write Peano’s example in the form 

J (&, Y) = ay? + 2 baPy + cat. 

It is seen that the function f(x, y) is positive in the neighbor- 
hood of the origin upon every straight line through it; however, 
upon the parabola y= ma* the function in the neighborhood of 
the origin 1s positive, zero, 
or negative according as 
am?+2bm-+c is positive, 
zero, or negative. 

We may further illustrate 
this as follows: Let the 
equations 


P(x, y= y — $(2)= 9, 
¥(@, YN=y¥—¥@=0 
denote two curves through 

the origin. The function 


F(@%, Y= Be, WV (@, 9) gs 
will have positive values for values of z, y on the are BA of a circle 
with origin at the center and radius sufficiently small and negative 
values on the are AD. Hence the function f(x, y) has minimum 


values on all straight lines through the origin that cut the are BA and 
maximum values on the lines through the origin that cut the are 4D. 
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If, further, the two curves ®(a, y) = 0, V(2, y) =0 have a com- 
mon tangent at the origin with their curvatures lying in the same 
direction, it is seen that all possible straight lines through the 
origin are such that the codrdi- y 
nates of any points on them cause 
J (#, y) to have positive values. 
This is true, for example, of the 
function already considered, 

I ( Y) = (y — p?2) (y — 9?2?). 

In the spaces above and below 
both curves we have /(z, y)>0, 
while this function is negative 
for the spaces between the two 
curves; so that there is a minimum upon every straight line 
through the origin, although there is a maxiumm* of f(a, y) for 


Dead: + ¢ 2. 
Dp 


Fie. 9 


all points on the curve y= 


Il. DIFFERENT ATTEMPTS TO IMPROVE THE THEORY 


26. The existence of an extreme of the function /(a, y) at the 
origin, for example, a minimum, depends upon the condition that 
there exists an upper limit g such that the function /(2, y) for all 
values of #, y which satisfy the condition 

0<V2e2+ P<y 
is positive ; or, geometrically speaking (§ 16), this condition implies 
that there exists a circle with center (0, 0) within which the func- 
tion is everywhere positive with the exception of the position 
(0, 0) itself. 

Instead of considering the values of such a function for the 
coordinates of points on straight lines through the origin, which 
lines may be written in the form 


= Os y = Bk, 


_ * Note in this connection Scheeffer, Math. Ann.,Vol. XXVI, p. 197; and Vol. XXXV, 
p. 545. 
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a, 8 being arbitrary constants, it would be natural to raise the 
question whether we could not determine the sufficient conditions 
for such extremes by studying the more general curves expressed 
through the algebraic equations 


a (kh) = ak + ae + +++ + oe 


y (k) = Byk + Bok? + eas + B,k" 


and make the requirement that the function f(x(k), y(k)) shall 
have an extreme for k=0, whatever values there may be assigned 
to the positive integers m and n and to the m+ quantities a, 
Gy, ***, Om; By By, -++, Bn, it being of course assumed that all 
the quantities @ and 8 are not simultaneously zero. 

It may, however, be shown that such sufficient conditions cannot 
be derived in the manner indicated. For if we write 


@ (x, y) = y — sin?z, 


W(x, y) = y — sin? — e = 


we have two curves defined through the equations ®(z, y) = 0 
and W(x, y) = 0 which have at the origin the a-axis as a common 
tangent and a contact of an indefinitely high order. 

There is consequently no curve of the form (i) which may be 
laid between these two curves; for clearly any such curve must 
have with either of these curves a contact of indefinitely high 
order which is impossible for an algebraic curye. 

On the other hand, the function f(x, y)= O(a, y)V (a, y) is 
positive in the whole plane excepting that part of the plane that 
is situated between the two transcendental curves, in which it is 
negative. Hence at the origin there is neither a maximum nor a 
minimum for the function f(z, y), although for this function upon 
every curve (7) there enters a minimum. 

We may therefore desist from further requirements in this 
direction, and we shall next call attention to two methods, the 
one due to Scheeffer and the other to Von Dantscher, which are 
general in character when the discussion has to do with two 
variables and which lead to criteria which are of use in practice. 
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27. Scheeffer’s method. We have seen that functions of one 
variable which have ordinary extremes can be grees through 
the Taylor development in the form 


FT on (O01), (a) 


when f(x) and the derivatives f’(z),--+, f(x) are zero for 
z=0 while f(x) #0 for x=0. For such functions the change 
in value in the neighborhood of the position «=0 on either 
side is faster than that of a given quantity ax”; that is, positive 
quantities a, n, and 6 may be so chosen that be all values of x 
within the interval —6 to +6 the absolute value of /(zx) is 
greater than the absolute value of az”, excepting the value x= 0. 
For since f™(0)#0, we may so determine 6 that for values of 
# such that —d=x2=6 the function /™(zx) is different from 
zero. If, then, we choose the quantity a smaller than the absolute 


(2) 
value of I LO j in the interval — 6 to + 6, then (see formula (a)) 


within this eel the condition |/(a)| >|ax”| is satisfied. Recip- 
rocally, if the last condition exists, the m first derivatives of 
J (#) cannot all vanish for «=0. For in the latter case we 


Id h s 
wou ave mee +) (Oz) oe 
(n+1)! j 
and from this it follows that 
Sony 
Pa ", On 


which contradicts the assumption that | /(«)| >|ax|. 
1 


There are functions, however, for example e @ (cf. Pierpont, 
loc. cit., Vol. I, p. 205), for which such quantities n, a, 6, do not 
1 


exist. In fact, the absolute value of e ™ ig in the immediate 
neighborhood of «= 0 smaller than any arbitrary power aa”. 

We may note that the characteristic property of the above 
requirement consists in the fact that the behavior of the function 
in the neighborhood of the origin must be marked with a certain 
degree of distinctness. 
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The following consideration leads to the generalization of the 
above condition for functions of two variables: It is clear that a 
function f(#, y) which vanishes at the origin, if it is continuous, 
has upon the circumference of every circle which is described 
about the origin as center with an arbitrary radius 7 a greatest 
and a least value, provided the function does not reduce to a 
function of one variable r =Va?+ y2. The signs of these greatest 
and least values, which we denote by f,(r) and f,(r) respectively, 
offer for sufficiently small radii r a criterion regarding the appear- 
ance or nonappearance of an extreme at the origin.* For if the 
two quantities f,(r) and f,(7) are positive, there will be a mini- 
mum of f(#, y) at the origin, while if they are both negative, a 
maximum exists at the origin. The degree of distinctness which 
marks the behavior of the function at the origin is characterized 
through the existence of a power ar” with the property that for 
every value of 7 within a certain limit g both /,(r) and /,(r) 
are in absolute value greater than ar”. 

If this requirement is not satisfied we cannot count upon 
deriving sure characteristics of extremes through the expansion 
in series. For in this case the value with which the function 
J(#, y) in. the neighborhood of the position (0, 0) either ap- 
proaches the value zero from the one side, or having passed 
through zero differs from it on the other side, is so little that 
this value cannot be expressed through a power ever so high 
of vr, The development in series cannot, therefore, serve to de- 
termine whether the value is a little on the one side or on the 
other side of zero. 

As examples of this kind are the function 

ae 
yt es 
which has a minimum value at the origin, and the function 


Pp p 2 


2 
2 et - 22  2a2 
Pm 6 Psy e87 ee 
* The behavior of the function f(z, y) at any point %, Yo other than the origin 


may be made by the substitution «= 29 +h, y= yo+k, to depend upon the behavior 
of the function f(x +h, ¥o + &) = F'(h, k) for the values h = Ove =10; 
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which has neither a maximum nor minimum at the origin. The 
first function Ue eee the value zero from the positive direction 


up to the value e co (for y= 0) while the latter approaches the value 
zero from the negative direction by the same amount. 

To this class of functions belong also those functions whose 
initial terms constitute a semi-definite form and which contain as 
a factor an even power of a series P(x, y) the terms of which 
vanish for real pairs of values , y in every region arbitrarily 
small where 0<|az|<6, 0<|y|<6 (see §§$ 36 and 41). Belonging 
also to this category of functions are the functions which reach 
the value zero but do not pass through it for every region arbi- 
trarily small where 0 <|x|<6, 0<|y|<6. 

If on the other hand there exists a power ar” whose value, so 
long as we remain within a certain limit g, is always smaller than 
the absolute values of /,(r) and f,(7), then the question whether 
at the origin an extreme of the function exists may always 
be answered by a development in series and by a finite num- 
ber of observations. How this is accomplished is found in the 
next chapter. 

28. The method of Von Dantscher. We have seen that by 
considering the extremes on every line through (0, 0) we are not 
able to form any conclusions regarding the extremes of the func- 
tion f(x, y) at this point. Von Dantscher’s method consists in 
establishing criteria not only for the extremes on such lines but 
also for all points in the plane in the neighborhood of the points 
on these lines and also in the neighborhood and on both sides of 
the point (0,0). Although Von Dantscher himself finds that there 
is “no need of an extension or improvement of the Scheeffer 
method,” I shall give later the method of Von Dantscher, as it is 
of interest in itself and, besides, it is well to compare the two 
theories (see §§ 42, 44). 

29. The Stolzian theorems.* We shall at first assume that the 
function f(x, y) is continuous with respect to both variables in 
every point (x, y) of a rectangle that includes the point (0, 0), the 


* Stolz, p. 213. 
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sides of the rectangle being parallel to the codrdinate axes. We 
shall state and then prove the following theorems: 


TurorEM I. A necessary condition that f(0, O) be a proper 
extreme of f(x, y) is offered through the existence of an interval 
—§...46, within which x(#0) lies, and such that the upper 
limit of f(x, y), when « takes a 
constant value, the variable y being 
confined to the interval + @+++— «x, 
is had through the value y = $,(“), 
and the lower limit through 
y= $,(“). This necessary condition 


in question for a proper maximum 
is that f(x, y(@)) be invariably 
less than f(0O, 0), and for a proper minimum we must have 
invariably f(a, b,(x)) greater than f(0, O). 


Fie. 10 


In the first case the upper and lower limits of f(z, y) are both 
less than (0, 0) and in the second case they are both greater. 

Note that lim ¢,(#)= 0, since |,(x)|=|z|. The same is true 
of ¢, (2). si 

The same conditions must be true with regard to the wpper and 
lower limits of f(a, y) with constant y such that |y|<6, the vari- 
able x being limated to the interval —y---+y, which limits are 
reached through the values o(y) and wp, (y) respectively. 


THEOREM IT. The fulfillment of all the conditions made above is 
sufficient that f(0,0) be a proper extreme of f(a, y). Accordingly 
(0,0) is a proper maximum if there exists a positive quantity § 
such that we have simultaneously 

[1] for 0<|x%|<8, f(x, b4(x))<f(0, 0), 
and 

[2] for V<|y¥|<8, Fay), ¥) < S(O, 0) 


with corresponding conditions for a proper minimum. 


To prove the two theorems just stated we remark first that on 
account of the continuity of f(x, y) with respect to y the function 
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J(#,y) with constant # and with the assumption that y takes all 
values of the interval — x--- +. has for all these values a finite 
upper limit and a finite lower limit, and further that f(z, y) reaches 
these limits for values y = ¢,(7) and y = ¢, (z) (see § 8). 

Hence for values of y such that 


[3] |y|=|2#| it is clear that f(x, y)=/(a, 6,(2)). 


Furthermore, in virtue of the definition of a proper maximum of 
J(#, y) there must be a positive quantity 6 such that if only || 
and |y| are smaller than 6 we must have 


[4] S(& ¥) —f(0, 0) <0. 

It follows, if |z|<6 and «#0 and if we substitute y=,(«) in 
4], that 
- F(&, (0) —F(0, 9) <0, 


which is in fact the inequality [1]. 
Reciprocally from [1] and [3] are obtained the inequalities 


0<|z|<6 and /(@, y)—/(0, 0) <0, 
where ly|=|z]< 6. 


If the relation [4] is to be true for all systems of values (x, y) 
where || and |y| are smaller than 6 (excepting «= 0 and y= 0), 
then in addition to [1] we must have the corresponding pair of 
inequalities [2], which may be derived without trouble. 

We have corresponding conditions for wmproper extremes : 


THEOREM III. Jn order that f(0, 0) be an improper maximum 
of f(x, y) it is necessary and sufficient that there exist a positive 
quantity 8 such that ‘for any « with absolute value less than 6 
the value f(x, $y(#)) is not greater than f(0, 0) and for any y 
with absolute value less than & the value f(wy(y), y) ts not 
greater than f(0,0); while at the same time corresponding to 
every positive quantity 6! which is less than 6 there is at least 
one value of x or y whose absolute value is less than 6! and for 
which either f(x, b,(x)) or f(Wrg(y), y) ts equal to f(O, 0). The 


conditions for an improper minimum follow at once. 
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THEOREM IV. That /(0, 0) may not be a minimum (proper or 
improper) of f(z, y) it is necessary and sufficient that to every 
positive quantity 6 there either exists a quantity z’, with absolute 
value less than 6, such that 


[5] S(#', b1(2')) <fO, 0), 
or that there exist a quantity y', with absolute value less than 6, 
such that 

[6] Sii(y), ¥')<F(0, 0); 
and that 7(0, 0) may not be a maximum (proper or improper) 
of f(x, y) it is necessary and sufficient that corresponding to 
every positive quantity 6 there may be found either a quantity x’, 
with absolute value less than 6, such that 


[7] Se", ba(#')) > F(0; 0), 


or a quantity y’’, with absolute value less than 6, such that 


[8] Sb"), oy") >F(0, 0). 


CHAPTER IV 


THE SCHEEFFER THEORY 


I. GENERAL CRITERIA FOR A GREATEST AND A LEAST 
VALUE OF A FUNCTION OF TWO VARIABLES; IN PARTIC- 
ULAR THE EXTRAORDINARY EXTREMES 


30. The theorems of Stolz which were developed in the pre- 
ceding article are closely related to those of Scheeffer, which are 
of more practical value since the computations required have to 
do mostly with a few of the initial terms of the expansion of 
J («, yy —f(0, 0) in ascending positive integral powers of x and y, 
We shall assume that the function f(z, y) is such that it may 
be expanded by the Taylor-Lagrange theorem in the form 


S(@th, y+h) 
=f (x, y)+[hf) (2+ Oh, y+ Ok) + kf) (e+ Oh, y + Ok)] 
=f (2%, P+ MSL a Y+ EF) (@, + EMPL + Oh, y + Oh) 
+ 2hkft) (2+ Oh, y + Ok) + hfy, (@ + Oh, y + Ok)], etc., 
where 0<0<1. 


If we write <=0, y=0 and then put h=a, k=y, it is seen that 
[1] ST (2, Y) — FO, 9) = G, (@ Y) + Fn a (% Y)s 


where G',,(z, ¥) denotes the collectivity of terms of the 1 first 
dimensions and &,,,,(#, y) is the remainder term (Lagrange, 
Théorie des Fonctions, Vol. I, p. 40). 


The Scheeffer theorem. Jf an index n and positive quantities 

a and 8 can be determined to satisfy the two postulates (1) that for 

all values of « such that O<\x|<86 the upper and lower limits 

of |G, (& y)|Za|x|", with constant values of « and with y limited 

to the interval —x---+a, and (2) that for all values of y such 
45 
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that O<\|y|<6 the upper and lower limits of |G, (#, y)|=aly|”, 
where y has constant values and where « lies within the interval 
—y.-.+y, then the two functions f(x,y) and G,(%, y) have 
simultaneously on the position (0,0) either a proper maximum or 
a proper minimum. 

For, let the lower and upper limits of G,, (x, y), with constant x 
and with |y|=|z|, be G,, (7, B,(x)) and G, (xz, Py (x)) (see § 29); and 
with constant y and with |x|=|y| let the upper and lower limits 
of G, (x, y) be G,(Vo(y), y) and G,(V,(y), y). Since F,, .1(#, Y) 
is a homogeneous integral function of the (n+ 1)th dimension in 
x, y and consists of n+ 2 terms, we note that corresponding to 
any positive quantity e’ we may always find another positive 
quantity 6’ such that if 


|y|=|z| and 0<|a|<0, then |Z, ., (x, y)|< (n+ 2)e'|a||a|"; 
and also Se that if 

|z7|=|y| and 0<|y| <8’, then |&, .1(#, y)|<(m + 2)e'|y||y|". 

Hence writing (n + 2)! |x| =e and (n + 2)e'|y|= e, and denoting 


the corresponding value of 6’ by 6, it is seen that there is always 
an interval — 6.---+6 such that if 


[2] O0<|«|<6 and |y|=|z|, then |Z, ,,(a,y)|<e|a|"; 
and if 
[3] 0<|y|<8 and |a|=|y|, then |B, 44 9)|<ely!™ 


It follows then from [1] and [2] that for values of x, y such 
that |z|<6 and |y|=|2| we have 


[4] Gn(%, P, (w)) — e]a|”< f(x, y) — (0, 0) 
< Gy (x, By (x)) + €| |"; 
and from [1] and [3] that for values of a, y such that 0< ly|<6 
and |z|=|y| we have : 
[5] Gr (¥i(Y), ¥) — €|y|"<F(@, y) —£O, 9) 
<G,(Va(y), 9) + ely". 
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If next we assume that G,(0,0) is a proper extreme of 
G,(“, y) and that the two postulates of the theorem have been 
satisfied, then if G,(0, 0) is a minimum it is evident for small 
values of # and y that G, (a, ®,(z)) and G,(V,(y), y) are positive 
quantities, and from the postulates it follows that for values 


0<|#|<6 and |y|=|a| we have G, (, ®, (x))= a|a|” 
and for values 
0<|y|<6 and |2|= |y| we have G,(V, (y), y)= aly|". 
Accordingly it follows from [4] for values 
[6] 0<|z|<6 and |y|=|a| that (a —©)|z|"< f(a, y) —f(0, 0); 
and from [5] for values 


[7] 0<|y|<6 and |z|=|y| that (a — e)|y|"</f(@, y)— (0, 0). 


Since e may be made smaller than a, it follows in both [6] and 
[7] that f(z, y)—f(0, 0) is positive and consequently that /(0, 0) 
is a proper minimum of /(, y) (see Stolz’s second theorem, § 29). 

If G,, (0, 0) is a proper maximum of G,,(«, y), then with small 
values of « and y the expressions G', (7, ®,(z)) and G,(W4(y), y) 
must be negative. 

Hence, due to the postulates for values 


0<{a|<6 and |y|=|z|, we have G, (x, Py (~)) S— a|a|", 
and for values 
0<|y|<6 and |az|=|y|, we have G) (V5 (y), y) S— a|a|"5 


and in a similar manner as above it follows that /(0, 0) is a proper 
maximum of f(a, y). 

31. Stolz’s* added theorem. Jf G,,(0, 0) is not an extreme of 
G, (a, y), the following conditions are sufficient to make it impossible 
that f (0, 0) should be an extreme of f(x,y): Uf (1) for all positive 
values of « and y such that O<|a|<8 and 0<|y|<6, or for all 
negative values within the same limits, at least one of the two upper 
limits of G,(x, y) defined above is positive and not less than a|x|" or 


* Stolz, p, 218, 
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a\y|" respectively, and (2) for all positive values of x and y such 
that O<|a|<68 and O<|y|<6, or for all negative values within 
the same limits, at least one of the two lower limits of Gp(&, Y) 
defined above is negative and not greater than —a|\x\" or — aly|” 
respectively ; that is, if, under the restrictions just made, G, (#, ,(x)) 
is positive and G., (a, B,(x)) negative, or if G,(Vo(y), ¥) positive 
and G(Vy(y), y) negative. 


If we limit x, for example, to the interval 0... 6, and if we 
suppose that the following inequalities G(x, B,(x))=a|x|" and 
G, (a, B,(x)) S—a|a|" exist, it is seen that these two expressions 
vanish only for «= 0. 

From [1] and [2] it follows for y= ®,(«) and y= ®,(z) that 
for values of x within the interval in question 


f(z, B,(x))—f(0, 0)<—(a— |x|" 
and S(e, B,(x)) —f(0, 0) > (a — e) |x|”. 


Since we may take e<a, it is seen that in the two expressions 
just written, the difference on the left-hand side is in the first case 
negative and in the second case positive, so that f(0, 0) is not an 
extreme of f(x, y) (see Stolz’s fourth theorem, § 29). 

32. The analytic proof given in § 30 of the Scheeffer theorem 
is essentially due to Stolz. Owing to its importance we shall give 
Scheeffer’s statement of this theorem with his geometric deductions 
(Math. Ann., Vol. XXXYV, p. 553). 


The Scheeffer theorem otherwise stated. Let f(x, y) be any 
Junction as already defined of x,y which vanishes at the origin* 
and let its behavior in the neighborhood of this point be suffi- 
ciently explicit for the determination regarding the appearance 
of extremes by means of power series to be possible ; in other words, 
we assume that there exists a power ar” such that wpon every 
circle described ubout the origin as center, whose radius r is not 
smaller than a definite quantity g, the greatest and the least values 
of the function f(x, y), viz, f(r) and f,(r) for all points of the 


* If f (0, 0) #0, we must write f(x, y) —f (0, 0) in the place of f(a, y) in the present 
discussion. 
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circumference of the circle with radius 7, are in absolute value 
greater than ar". Then in the Taylor-Lagrange development 


given above 
J (&, y) a G,, (@, Y)+ hi, +4 (x, Y), 


where R,,,4(%, y) consists of all terms beyond those of the nth 
dimension, the integral rational function G,,(«, y) behaves in the 
neighborhood of the origin as does the function f(x, y). For, as 
we shall show, in the first place the greatest and the least values 
of both functions correspond with respect to sign for every small 
radius r, and from this it follows that there appear simultan- 
eously at the origin extremes for both functions, if such extremes 
exist ; and secondly, if a' is any quantity situated between O and a, 
then upon the circumference of every circle with radius r (within 
a certain limit g') the greatest and the least values of the function 
Gn(#, y) are in absolute value greater than alr", and from this 
it follows also that the degree of distinctness that marks the 
behavior of G(x, y) is the same as that of f(a, y). 


It is evident that we may replace # and y in the remainder 
term F,,,,(@, y) by 7, where 7 is the radius of the small circle 
about the origin within which the point (#, y) is situated; and 
at the same time we may replace all coefficients by their absolute 
values. In this way we have for the absolute value of #,, ,,(#, y) 


an upper limit 47+! We shall take the radius 7 smaller than a 
so that ar” > Arn*1, 

Since f,(r7) and f,(7) are by hypotheses greater in absolute 
value than ar”, it follows from the equation 


G,(, Y=S(% Y) — Ba si(@Y) 


that those values of xz, y on the periphery of the circle with 
radius 7 which give /,(r) and /,(7), cause G,(z, y) and f,(#, Y) 
to have the same sign. If 7,(7) and f,(r) have the same sign, it 
follows from the above expression that the greatest and least 
values of G,,(a, y) have this same sign. If the two quantities 
f(r) and f(r) have contrary signs, the same is true of G,(x, Y) 
for those values of x, y which produce f,(r) and /,(r); and 
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consequently for a greater reason the greatest and least values 
of G,,(z, y) have contrary signs. 
The second part of the theorem follows in the same way if 


we take the radius 7 not only smaller than i but also so small 


I 
io 
and 


that ar"— Arn+1>a'r"; that is, if we put g’ equal to 


take r less than g’. It is then evident that the values of a, y 
which produce f,(r) and f,(r) when written in the expression 


Gi, (@; Y) =f (2, 4) — Fi, 44 (4 Y) 


cause the right-hand side to be in absolute value greater than a’7” 
when /,(7) and f,(7) have the same sign; and when these two 
quantities have contrary signs the corresponding values’ of 
G,(#, y) will in absolute value be greater than a'r”, and the 
same must a fortiori be true of the greatest and the least 
values of G,(@, y). 

33. If, however, we cannot find an integer m and a quantity a’ 
which satisfy the conditions above, we can make no conclusions 
regarding the behavior of the function f(z, y) by means of powers 
series and by using the method indicated. For in this case we 
shall show by means of simple examples which follow this chap- 
ter that in some cases the function G,(z, y) is invariably positive, 
while /(z, y) may be also negative; and in some cases G,(a, y) 
may be both positive and negative while /(z, y) retains a con- 
stant sign (see Ex. 3, p. 61, and Prob. 2, p. 62). But if the 
conditions of Scheeffer’s theorem exist it is seen that the in- 
vestigation of the function f(x, y) has been reduced to that of 
the function G,(x, y); in other words, the investigation has 
resolved itself into the question: How can we recognize whether 
a limit g' and a quantity a! ewist such that wpon every circle 
with radius r<g! the greatest and the least values of a given 
integral function of the nth degree G(x, y) are in absolute value 
greater than a'r”? And how can we eventually fix the signs of 
these greatest and least values and thereby determine the extremes 
of the function G(x, y) ? 

These questions we shall now answer. 


THE SCHEEFFER THEORY 49 


II. HOMOGENEOUS FUNCTIONS 


34. In the expansion of f(#, y)— (0, 0) suppose that the first 
terms that appear form a homogeneous function of the nth degree 
in x and y which is the function G,,(x, y). With respect to such a 
function there are three cases to consider, according as this func- 
tion is a definite form, an indefinite form, or a semi-definite form 
(see § 13). If we write 


t=N , 

Gi, (2, y) => ajar ty, 
“ i=0 
it is seen that G(x, y) changes upon every straight line through 
the origin proportionally to the nth power of 7. If then G, and 
G, are the greatest and the least values of G,(#, y) upon the 
periphery of the. unit circle, then Gr” and Gyr” are the greatest 
and least values upon any arbitrary circle 7. 

The signs of G, and G, may be obtained ook through 
decomposing G',,(“, y) into its linear factors, which may be found 
by solving an equation of the nth degree. For we may write 


Gn (® Y) = mc, 1, d) = ag (u), where : cane 


and g(w) is an integral function of the nth degree in w. Owing to 
the fundamental theorem of algebra, g(uw) may be decomposed 
into factors which are linear and quadratic with negative dis- 
criminants if we restrict all the coefficients to real quantities; or 
these factors are all linear if we allow imaginary coefficients, the 
quadratic factors breaking up into two imaginary linear com- 
ponents. If these factors are multiplied by the respective powers 
of z, we have the corresponding decomposition of G(x, y) into 
its linear and quadratic factors. At the outset it is clear that 
if the degree of G,(“, y) is odd, then G, and G, must be equal 
but of opposite sign, since G,(#, y) changes sign when 2, y are 
changed into —z, —y. ‘Furthermore, note that G,(@w, oy) = 
o"G,(z, y), Where @ is a positive quantity. It follows that if 
G.,(a, y) is positive, negative, or zero, then G,(@x, wy) is positive, 


negative, or zero. 
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If G(x, y) is an indefinite form, there are values 2, y which 
give G,(x, y) a positive value, and other values , y which give 
it a negative value. Let 6 be a positive quantity however small. 
It is seen that by a proper choice of # we may find values of a, 
y where |x|<6 and |y|<6 such that G,(z, y) is positive, and other 
systems of values x, y within the same interval for which G, (2, y) 
is negative. Accordingly the value G,(0, 0) is not an extreme of 
Gn (ty Y)-™ 

If, however, 7 is even, and, first, if the linear factors of G, (2, y) 
are all imaginary, then G,,(x, y) cannot change sign nor vanish. It 
is a definite form and the quantities G, and G, have the same 
sign. If, secondly, there are real linear factors, and if at least one 
enters to an odd degree, then G,(a, y) takes both signs. G,(x, y) 
is then an indefinite form and the sign of G, is different from 
that of G,. If, thirdly, there enter real linear factors, but each 
only to an even degree, the form G,(z, y) may vanish but it 
cannot change sign. It is a semi-definite form, and one of the 
extremes G, and G, is zero. In this case by a proper choice of 
w above it is seen that G,(#,y) vanishes for values of 2, y other 
than zero and situated within the interval |z|<6 and |y|/<6. 
In this case G,(0, 0) is an improper extreme of G,,(z, y); and 
the behavior of f(z, y) at the origin cannot be recognized with- 
out further discussion. 

In all casest except the last a positive quantity a’ may be so 
determined that upon every arbitrary circle 7 the greatest and 
least values of the function G, (2, y), viz, G,r” and Gr”, are in 
absolute value greater than a'7”; for we need only take a! smaller 
than the absolute values of G, and G,. In these cases (again 
excepting the last) there are found in a sufficiently distinct 
manner (in the previous precise sense of the word, see § 27) either 
a maximum or a minimum of the function G(a, y), or there does 
not exist such an extreme. 

The decomposition of G(x, y) into its linear factors is not 
necessary, since we may determine the sign of G, and G, by 


* Cf. Stolz, p. 222. 
} The discussion is for the most part due to Scheeffer, loc. cit: 
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mene A dementary algebraic operations. For we may determine 
the saultigle factors of G,,(z, y) and write this function in the form 


Che, N=Vee Van VEN 
whene, in general, 4, is an irreducible factor of the kth degree in 
4 th y With tategre) coefiGents and r,, denotes the number of 
tae this factor occurs. Then by Sturm’s theorem we may deter- 
sine for cade such function ¥,(z, y) the number of real factors 
and by 2, the number of times such factor is repeated. 

The theory just outlined of the integral homogeneous functions 
Sess, owing to the Scheeffer theorem for the general theory of 
weexime 2048 minima of arbitrary functions, the following theorem : 

Lf ia the deedopment of the function f(z, y) in powers of x, y 
OD) terms of te first to the (nm —1)th dimensions are identically zero, 
while the terms of the uth dimension constitute a form G,,(z, ¥) 
homngeaeous ta 2 ond y, ond if, first, G, (#, y) & an indefinite 
Jorw (whith is ways the case if n is odd), then on the position 
0.0) Gere is wither 0 maximum nora minimum of the function 
SEs ¥, Lemdly, GZ, y) & & definite form, there enters accord- 
tag to the sign of this form an extreme of f(z, y); ¥f, finally, 
GMb, y) 2 semi-defuaite, the behavior of the function f(x, y) cannot 
be rengaiaed from the behavior of GZ, ¥). 

From this theorem it follows that if f(0, 0) is an extreme of 
SG), the terms of the first dimension of the expansion by 
Taylor's formula of f(x, y)—f (0, 9) must be wanting, and conse- 
quently we soust have 

SL0,9=9 and f/(0, 0)=0. 


14, furthermore, 
SL Y—-SO, 2) = AP + 2 Bay + C+ Elz, 9), 
thes (0, 0) is uot or is (im fact a proper) extreme of f(z, y) 
avoosting as AC — EP is negative or positive. If this discriminant 
is positive, then f(0, 0) is 2 maximum or a minimum according 
as A end C (which necessarily have one and the same sign) are 
negative or positive (see § 14). 
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But if 4C — B= 0, a criterion regarding an extreme of f(z, y) 
with the help only of the terms of the second dimension cannot be 
had. We must then take in addition terms of the third, fourth, - - - 
degrees in the above expansion of f(z, y) in order, if possible, to 
satisfy the postulates of Scheeffer regarding the function G’,(%, y), 
In this case we may write, if A is different from zero, 


P(e, ) —f(0, 0) = (Ae + By)? + Ble, + Rlo Yt ++ 


where 8, &,--- denote the collectivity of the terms respectively 
of the third, fourth, --- dimensions in 2, ¥. 
If in this expression we write «= Bt, y =— At, it is seen that 


f (Bt, — At)—f (0, 0) = Agf?+ Ayt+ ---, 


and if the constant A, is different from zero, it is seen that by 
giving positive and negative values to ¢, the above expression 
may take both positive and negative values, so that there is no 
extreme of f(x, y) on the position (0, 0). 

But even if the first term that appears on the right of the 
expansion in ¢ is of even degree, we cannot conclude that there 
is an extreme, as is illustrated by the classic example of Peano 
(see § 24), viz., f(a, y)= Ay?+ 2 Baty + Cat. 

Further investigation is therefore necessary when the terms of 
the second degree constitute a semi-definite form, and this case 
is continued in the following sections. 


Ill. EXTREMES OF THE FUNCTION G;,(z, y), INTEGRAL 
IN « AND y, WHICH IS NOT HOMOGENEOUS 


35. We must next determine whether or not the value G,,(0, 0) 
is an extreme of G,(x, y) when this function is not homogeneous 
in « and y and when the terms of the lowest dimension in Gi, (a, y) 
constitute a semi-definite form. We must again raise the question 
regarding the existence of an expression a'r” which for all suffi- 
ciently small values of 7 is to be smaller than the absolute values 
of the greatest value and of the smallest value of G,, (x, y) upon 
the periphery of a circle of radius 7, where 7 is sufficiently small. 
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In order, then, to acquaint ourselves with the different possi- 
bilities which may enter in the behavior of the function G,(@, ¥) 
at the point (0, 0), we take a small circle with radius 7 and seek 
upon it the two positions at which the function G,(2, y) takes 
its greatest and its least value. Call. these values the extreme 
values of G(x, y). They are found (see § 15) by solving the 


three equations ag, 
"—Ax= 0, 

Ox 
OG, NO! 

ey 
a+ y= 7. 


By eliminating from the first two of these equations we have 
an equation of the nth degree 
0Gn OG'n 
nye = 
Ox oy 


0, (2) 


an equation which is satisfied by all values of w and y which 
offer extreme values of G,(z, y) upon any arbitrary circle 7. 

It is known in the theory of algebraic functions that every 
branch of an algebraic curve of the nth order which contains the 
origin may be expressed in the neighborhood of the origin through 
an independent variable (#, say) in the form 

& = Ayk + Agk*® + +++) 

y= bk + bk + -- it 
and this expression for the curve may be made in any number of 
different ways such that in each of the series for v and y the first 
coefficient which is different from zero (in case there is one) has an 
exponent which is =x. It follows that both those branches which 
include the origin of the curve (7), and whose points of intersection 
with the circles of small radii offer the extreme values of G, (, y) 
upon these circles, may be expressed in the form (i) through an 
independent parameter /, so long, at least, as we remain in the 
immediate vicinity of the origin; that is, so long as very small 
‘values are ascribed to &. We shall call these two branches the | 
two extreme curves of the function G, (2, ¥). 


(w) 


54 THEORY OF MAXIMA AND MINIMA 


36. We must next distinguish between the cases (1) when 
(excepting for isolated values of 7) the extreme values of G,, (2, y) 
are both different from zero and (2) when one of these extremes 
iS zero. 

If both extremes are different from zero, then the expression 
G,,(«, y), if we write for « and y the two series (i) which corre- 
spond to an extreme curve, will begin with a term Ak”, which for 
small values of / determines both the sign and the order of magni- 
tude of the entire expression. This order is the mth order if we 


— i ; 
consider & a quantity of the first order, and it is of the ” th order if 
be 


we consider i" the first order, where k” is the smallest exponent 
that actually appears in (77). The number p, as we saw above, can 
at most be equal to n. We have similar quantities A’, m’, pw’ for 
the second extreme curve. If the two numbers m and m’ are not 
both even, there can be no maximum nor minimum of G, (x, y) at 
the origin, since this function in this case changes sign with / 
upon an extreme curve. The same is true if m and m’ are even 
numbers while 4 and 4’ have opposite signs, for then the function 
G,(, y) shows different signs upon the two extreme curves. 

If, finally, m and m’ are both even while 4 and 4’ have the same 
sign, then we have a maximum or minimum of G,,(z, y) according 
as this sign is negative or positive. 

In all three cases it is clear that a quantity a’ and an upper 
limit g’ of r may be so determined that for r<g’ the values of 
G,(#, y) upon both extreme curves are everywhere in absolute 
value greater than a/r?, where p is the greater of the two 
numbers ~~ and ae 

LL j 

If, however, the value of G,,(«, y) is invariably zero upon one of 
the extreme curves, there cannot be a maximum or minimum at 
the origin, nor is there an expression a/7? of the kind required 
above. But this can only occur when G,,(z, y) contains a squared 
factor which when put equal to 0 defines a real double curve 
that passes through the origin; for otherwise, with the vanish- 
ing of G(x, y) upon crossing the circumference of any circle with 
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radius 7, there must be a change of sign in G,(z, y). The squared 
factor enters as a factor to the first power in (di), so that points 
on this curve make G,(z, y) identically zero. 

In the sequel we shall assume that such factors have been di- 
vided out of G(x, y), so that the case in question does not enter. 

Under this assumption, which must be tested in every indi- 
vidual case, there exists, in virtue of the considerations already 
laid down, always a smallest number p associated with which a 
constant a’ and an upper limit g’ of the radius 7 may be so de- 
termined that upon every circle of radius 7<g’ the two extreme 
values of G(x, y) are in absolute value greater than a'r? ; and, in 
fact, this number p (if the order of 7 is taken as unity) expresses 
the degree of the magnitude of the function G(x, y) upon that one 
of the two extreme curves upon which this order is the highest. 
If p is at most equal to n, then a'r? for small values of 7 is not 
smaller than a'r”, and the two extreme values of G, (x, y) are there- 
fore certainly greater in absolute value than a'r”; but if p is greater 
than n, then for small values of 7 at least one of the extreme 
values of G,,(z, y) 18 in absolute value smaller than ar”, however 
the constant a@ may be chosen. 

It is thus seen that in virtue of the fundamental theorem the 
function G(x, y) may be used as a criterion for determining the 
existence of a maximum or minimum of the function /(z, y), 
where G,, (x, y) consists of the terms of the first to the nth order 
of f(x, y) only when the characteristic exponent p is at most 
equal to n. 

37. If in an example we wished to discuss the function G*, (2, y) 
in the manner indicated above, we must calculate the coefficients 
of (#), which, in general,.is a somewhat complicated operation. 
The following method leads, however, indirectly to the same 
result, viz., that of finding the extreme values of G,,(z, y), and thus 
offers an easy method for the criteria in question. The method 
in question is first to make use of the Stolzian theorems of § 29, 
and then by applying the Scheefferian theorem we may reach the 
desired conclusions. Accordingly we must determine the upper 
and lower limits of G,, (x, y) with constant 7 and |y|=|2| as well 
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as the upper and lower limits of this function with constant y 
and |z|=|y|. For brevity put G=G,(%, y). 

The values of y, viz, y=®,(z) and y=®,(z), which offer 
the first-mentioned pair of limits, fall either within the interval 
—g-+.--+ or upon one of the end-values y=—# or y=+&. 
When they fall within the interval, since G,,(z, y) is a continuous 
function which has a first derivative with respect to y, it is seen 


PV al 


oi: 


that y = ©, (x) and y = ®, (2) are solutions of the equation - 


2a 


In the second case, when they fall upon the end-points of the inter- 
val, then y = # or y=— & may offer the desired limit or limits. 

It is: permissible throughout the whole discussion to fix a posi- 
tive quantity a<1 as the upper limit for |z|, where @ is taken so 
small that y= ®,(z) and y= ®,(z) are convergent series in 2, 


G : : 
which when substituted in the equation es = 0 identically satisfy 
it. Furthermore (see § 29), since lim ®, (z) and lim ®, (x) = 0, it is 

x=0 x=0 
seen that no constant term can enter these expressions. 
The method of determining the different values of y which 


satisfy the equation — = 0 is found in §§ 139 et seq. Let these 
values be J 


R@), Be, Blas. (i 
38. We may next see which of these functions may be neglected 


from the investigation. If P(x) denotes any of the functions 
P(x) (t=1, 2,---) and if P(x) has the form 


(1) P(x)=axe{a+a7R(x)}, where p>0O and o>0, 


then to any arbitrarily chosen « >0 there corresponds a quantity 
6>0 such that there are values |z|<6 for which |ur-R (x)|<e; 
- and for such values of « we have 


(2) |P (x) | >|a|f{\a|—e. 
If p lies within the interval 0<p<1 and if |x| is further so 
diminished that |a|—e>|z|!-», then from (2) it is seen that 


|P(x)|>|«|; and consequently y= P(x) would fall without the 
fixed interval —...+-«, We see, therefore, that any series which 
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begins with a term az?+..., where 0<p<1, may be neglected 
from the number of functions given in (i). 

If, next, p=1 and |a|>1, we may take ¢ so small in (2) that 
|a|—e>1, and consequently |P()|>|z|, so that such series may 
also be neglected. 

Furthermore, if one of the series (7) begins with +1. or —1- 2, 
and if the second term has the same sign as the first, then evi- 
dently |P(«)|>|z|, and such a series may accordingly be neglected 
from the investigation. 

39. The remaining series in (7), together with the values which 


correspond to the end-points, viz, y=+a and y=—2, give, 
when substituted in G(x, y), the following functions: 
G(x, —2), Ga, +2), E@ BO), 4B), +5 @ 


and we have to determine which of these functions presents the 
upper and the lower limits of the function G (z, y) for the interval 
in question. 

By taking @(<1) sufficiently small the first term in any of 
the functions (i) is as a rule sufficient in determining which 
will give the required upper and lower limits. Of course, if two 
of the functions (7) have their initial terms the same, it may be 
necessary to introduce their second and higher terms to determine 
which furnish the required limits. 

Of those functions whose first terms are negative the one with 
smallest exponent gives the lowest limit; and if two series have 
the same negative exponent, the one with greater coefficient 
offers the lower limit. If there is no function in (#%) whose first 
term is negative, then in determining G(x, ®,(x)) we note that 
of those functions whose first terms are positive that one with 
highest exponent offers the lowest limit; while if two functions 
have first terms with the same exponent, the one with smaller 
coefficient offers the lower limit. These observations must be 
made with both positive and negative values of x, where |x|<a. 
If one of the functions in the series (i) is zero, while the others 
all begin with a positive term, then G(x, ®,(z))=0, etc. We 
proceed in the same way in determining G(x, ®,()). 
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40. To determine G(V,(y), y) and G(V,(y), y), taking y con- 
stant, we limit to the interval —y---+y. Denote by 
iY), GY), GY), + 
those values of x which expressed in power series in terms of 


eat 


0G 


satisfy the equation 


ay 
yp 
UL 


The two limits in question are to be found among the functions 


G(—y, y), GY Y), GAY), ¥), G(@2Y)s Y)s +++ 


the method of procedure being the same as above. 

When each of the four limits G(®,(z), x), etc. has been deter- 
mined for values of « within the fixed intervals, the Stolzian 
theorem is at once applicable. If the expansion, say, of G(x, ®,(z)) 
is a,x0*+ ay,4,0°*1+4+..- and if k=n, we may at once find a 
constant ¢ such that G(x, ®,(x)), = cla"; 


and if the same is true of the three other limits the Scheefferian 
theorem is at once applicable. 

41. Exceptional cases. If the function G(x, y) contains factors, 
say x+y, then G(x, *- x) identically vanishes. More generally 


the equations G(x, y)=0 and oe 0 may be satisfied by the 
ey 


same series y= P(x). In this case, considered as an integral func- 
tion in y and with arbitrary z, the function G(z, y) has a repeated 
factor, say Q(x, y), which vanishes for y= P(x). Next suppose 
that G(w, y) is decomposed into its irreducible factors H,(z, y), 
H, (x, y), +++, and give to # such a value x, that each of these 
functions is also irreducible when considered as a function of y. 
Furthermore, since by hypothesis G («,, y) = 0 contains a repeated 
root y=P(#,), it is seen that two of the functions H;(z,, y), 
H,(%, y),+++, say H, and H,, vanish for y=P(z,). And since 
by hypothesis these functions are both irreducible with regard 
to y, they are identical except as to a multiplicative factor which 
is independent of y. But as H,(z, y) and HT, (x, y) are identical 
in y for an indefinitely large number of values such as = ey, it 
follows that the coefficients of like powers of y in these two 
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functions are identical, so that G(x, y) is divisible at least by the 
Square of an integral function H(, y). 

If at least one of the four functions, say G(a, ®,(x)), vanishes 
for values of x other than z= 0 within the fixed intervals, while 
for all other values this function retains the same sign, and if the 
other three functions are invariably of this same sign, then G (0, 0) 
is an amproper extreme of G(a,y). It follows that as a necessary 
condition for G'(z, y) to have an improper extreme on the position 
x=0, y=0, G(a, y) must contain as factor the even power of 
an integral function H(x, y) which not only vanishes for a = 0, 
y= 0 but also for values x, y whose absolute values are arbitrarily 
small. For if, in accordance with the above remarks, G=H*G, 
where G(x, y) contains no root y= P(x) which is also contained 
in H(x, y), and if & is odd, then as y passes through the value 
y =P (x) the function H* changes sign and therefore has values 
with opposite sign. 

Example 1. Let f(z, y) = ay?+ 2 bay + cxt+ R, (2, y), where a > 0 and 
R,(2, y) denotes any series beginning with terms of the fifth order in x 
and y. 

Writing G (2, y) = ay? + 2 bx?y + ex‘, it is seen that for x constant and 


\y|=|2], ce = 2(ay + bz?) is zero only for y =— oa We thus have 
G (:, — ° 2°) aaa as 
a a 
and G (ay Z) = ax? + 2 ba® + cat. 
The first expression offers the lower limit, while either G(x, + 7) or 
G (a, — x) offers the upper limit. 

We have three cases to consider : 

(a) ac —b?<0. Then of the two limits one is positive and the other 
negative. It follows that G(0, 0) is not an extreme of G(x, z), and as both 
limits begin with powers of x not exceeding the fourth, the Scheeffer 
theorem is applicable, which shows that (0, 0) is not an extreme of f(z, y). 

(B) ac —?>0. It follows since a > 0 that ¢ must also be positive. The 
two limits just derived are both positive. Continuing we must next deter- 
mine the other two limits. When y is constant and |z|=|y|, we have by 


solving the equation ag ; 
UE vere lge ACs Pom 78) 
hn ee 


b 
the two values Pa) ghitsl | wpetce ad <u 
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If b £0 the latter value may be neglected (§ 38), since the exponent of y 
lies between 0 and 1. If } = 0 this value coincides with the first. 

We observe that each of the functions 

G(0,y)=ay? and G(+y,¥) = ay’ + 2 by’ + cy* 

is positive. It follows from Stolz’s theorem that G(0, 0) is a proper mini- 
mum of G(z, y); and since the power of z or y on the right-hand side of 
any of the four limits is not greater than 4, the Scheeffer theorem shows 
that f(0, 0) is a proper minimum of /(2, y). 

(y) ac—=0. From above G(s, _ a) = 0, while the other three 

a 

limits are all positive. In this case G(0, 0) is an improper minimum and 
the Scheeffer theorem is not applicable, so long as we regard RF, (2, y) as an 
arbitrary power series with initial term of the fifth or higher dimension. 
(Stolz, p. 235.) 

Example 2. f(x, y) = y+ (az? + 2 bry + cy*)y + Ry(@, y), (@ 4 0). 

We have here G(a, y) = y + (ax? + 2 bay + cy") y. 


Taking z constant and |y|=|z|, we find as a solution of 


oC = 0 =2y + art 2 bry + cy? + 2y (bx + cy) 
y 
yao att +++ = (2), say. 


Forming the functions 
2 
G (a, o(2)) =—- rua +» and G(e,4+2)=27+[2b4(atc)]23 


it is seen that the first furnishes the lower limit, while one of the last 
functions offers the upper limit. It is evident that with x taken sufficiently 
small these two limits have contrary signs, so that G(0, 0) is not an extreme 
of G(z, y). Furthermore, since the lower limit begins with a power of z 
greater than 3, the added theorem of § 81 is not applicable. 

Proceeding further and taking y constant and |x|=|y|, we have as a 
solution of 

a = 0 =2y(ax + by) (since y is taken constant) 


L=— " y, which cannot be considered (§ 38) 
unless |b|<|a|. Forming the functions 
ay O 
G(-2yy)="+ 5 Git y y= P+ (Gb 2b +c)y¥, 


it is seen that both the upper and lower limits are positive. It follows that 
the added theorem is not applicable. We cannot, therefore, make a negative 
assertion regarding the extremes of f(z, y). (Stolz.) 


a 
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Example 3. f(z, y)=y? + 2*y + 2t + R,(a, y). 
In this example we have G(a, y) = y? + 2% + xt. 
With x constant and |y|=||, we have as the solution of 


0G id 
—=0=2 2 =—_——. 
a Yr, ¥ 3 


We thus have the functions 


a 3 
G(s, — 5) a2 and Gita, +2)=2? + oeree 


With y constant and |x|=|y|, we have from 


SS = ay +428 =0, w= — 5. 


It follows at once that 


o(iaftsn)=Se and G(tyy)=y+-- 


The value G'(0, 0) is consequently a proper minimum of G(z, y), and as 
none of the above series has an initial term with exponent greater than 4, it 
follows from Scheeffer’s theorem that (0,0) is a proper minimum of f(z, y). 
Although there is a proper minimum for f(z, y) = y? + 2*y + «4, it may be 
shown that G(z, y)=7?+ 2*y has neither a maximum nor a minimum. 


(Scheeffer, loc. cit., p. 573.) 
Example 4. Peano’s classic example : 
S@ Y=G ¥) + K(x, y), 
where G=y? —(p? + @*) ay + p*q?xt. 
With x constant and |y|=|z|, we have 
= =2y—(p? + 7) 2? =0, 


2 2 
so that Y= ae ne 


Forming the functions 


2 2 4 
G(«,4 i w)=—F (ph— 99) G(x, tz)=X4-s, 


it is seen that the upper limit is positive, while the lower limit is negative. 
It follows that G(0, 0) is not an extreme of G(z, y); and as the initial terms 
on the right have exponents that are not greater than 4, it follows from 
the Scheeffer theorem that f(0, 0) is not an extreme of f(a, y). 
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Example 5. f(z, y)=G(«,y) + Ry3(@ ¥), 
where G(2, y) = xy! — 3 xty? + (a8y? — 3 zy’ + y®) —10 xy + 5 x, 


With x constant and |y|=|«|, we have from 


Oe = 4 xy? — 9 xty? + (2 28y — 212y® + 8y’ —10zr") =0, 
oy 
as a solution (see § 145), 


y = 2274+ 324+ --- = d(2), say- 
Forming the functions 
G(z, $(@))=—4a4---~and Gi, +2)=a5s-::, 


which (see again § 145) offer the upper and lower limits of G(z, y), it fol- 
lows from Stolz’s theorem and the Scheeffer theorem that neither G(s, y) 
nor f(«, y) has an extreme on the position z = 0, y= 0. (Scheeffer, loc. cit. 
p- 575.) 


PROBLEMS 
1. Show that /(0, 0) is a minimum of 
I (&, y) = y' + 28 — 108 zy — 2 + R, (a, y)- (Stolz.) 
2. Writing G(a, ¥) = 4? — 2a%y + 24 + 4, 
LQ Y= YP 2 wty + x8 + yt — 28, 


show that G(0, 0) is a minimum for the first function but that f(0, 0) 
is not a minimum for the second function. Write in the latter expression 
YS (Seneaiie) 


IV. THE METHOD OF VICTOR VON DANTSCHER 


42. Instead of considering the extremes upon the straight lines 
through the point P(x, y)) we may derive the criteria for maxima 
and minima in the neighborhood of the points on these lines on both 


sides of the point (a), yj) in the xy-plane. With Von Dantscher* 
let the straight lines through (a, y)) be denoted by 


(1) = &y+rp, Y=Yot HP, 


bh Yay be . 
so that ae or Y —Yo= 5 (= %), 


where » and w are real variables such that ’2+ w2=1 and where p 
is a real variable which may have both positive and negative values. 


*See Muth. Ann., Vol. XLII, p. 89. 
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For extremes of f(a, y) at the point R we must have 


J (y+ AP, Yo+ Kp) — f (Xo, Yo) = 0 ( 


in case of a ee 

proper or wmproper 
an case of a mini 

J (%y + AP, Yo+ Mp) — f (%; Yo) = 0 ( of sie 


proper or improper 
Jor all values of p of a certain interval 
— <9, 


while for values p= p or p= q the above difference not only vanishes 
but changes sign. 


The thesis of Von Dantscher may be stated as follows: “If the 
lower limit, 7 say, of p in the region \?+ yw?=1 is different from 
zero, then /(%), Yo) is @ Maximum or minimum for the surface- 
neighborhood of the point (), yj); but if the lower limit of p is 
zero, then on the position %), yp) there is neither a maximum nor 
a minimum of the function f(x, ¥).” 

The decision as to whether a maximum or minimum exists for 
a given function /(z, y) on a point “, Y) in whose neighborhood 
J («, y) can be developed in integral positive powers of #— 2)=h, 
Y — ¥)=*, and on which point the first partial derivatives with 
respect to « and y both vanish, is consequently reduced to the 
investigation as to whether the quantity p is different from zero 
or not. 

If in the supposed development the nth dimension is the first 
whose terms do not all vanish, we write 


(2) f(t h, ¥o+ *)—S (Ho Yo) 
=g(h, k)= (h, hk), +(h, h)n 41+ (h, k)n 49+ ae? (n = 2) 


where (h, ),, denotes the sum of the terms of the mth dimension 
in h and &, ete. 
If we write in this expression 


(3) h=rp, k=pp, M+ w=, 
we have 


(4) gh, b= prs Mn+ POs Mart ++ 1= p"P(ps  H)- 
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The factor p” may be omitted, since to the value p= 0 there 
corresponds the position h=0, k=0 itself. The quantity r is 
accordingly nothing other than the lower limit of the absolute 
values of the real roots of the equation 


(5) $(P3 ry H)=(Ay Bn tr Hn 4a + += 0: 


From this the following is at once evident: 

Case I. If (h, &), is a definite form (§ 13), that is, one which 
takes the value zero for the one and only pair of values h = 0, 
k= 0, which case can only enter when 7 is even, then (A, #), 
is different from zero for all values A, mw which are different 
from zero, and consequently |(A, “),| has a lower limit / 
which is different from zero. We may, consequently, for the 
region A2+ yw2=1, determine a positive quantity 7 such that 
for |p|<7 we have 


A, B), =I>|A, Pn +1P + (A, Bn +2P?+ Cae |. 


The equation (5) has therefore no root p whose absolute value is 
not greater than 7; the quantity 7 is therefore different from zero, 
and there is consequently a maximum or minimum according as 
(h, k), 1s a negative or positive form. 


Case II. If (h, hk), is an indefinite form (§ 13), that is, one 
which for real pairs of values (h, %) takes both positive and nega- 
tive values, then also (A, mu), is such a form. It is then easy to 
show that in this case the equation 


O=(A, HrtA, MnaiP te: 


in any interval as small as we please — e<p<e, has roots that 
are different from zero, and consequently 7 = 0 and also ST (o> Yo) 
is neither a maximum nor a minimum. 


43. Case III. We come finally to the semi-definite case 
(§ 13); that is, one where (h, &), vanishes for pairs of values 
h, k which are different from zero, but does not change sign. 
It contains necessarily real linear factors, and, in fact, each 
one to an even power. The number n is consequently even, 
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and it follows that (A, “), is necessarily also a semi-definite 
form, whose factors are, say, 


(6) kyh — hyk, kgh — hok, + + +, Fgh — Ink, 
so that (h, k), is of the form 
(hy kn = (kyh — hyh)?4 (hgh — hak)? 
me (Kh a Iiy,ke)? (h, Ki)» —3(y+ lg+++++ ln)? 


where /,, 1,,+++ lm are positive integers and (h, hk), oa+u4---4 ln) 
is a definite form or a constant. ; 

To each such linear factor k,h —h,k(o =1, 2,-+-+, m) of (h, k)n 
there corresponds a linear factor w,A — r,m of (A, “),, Where 


y i 
Xr Stall res bo pe Ea, r2+ w2—1, 


io. a ? ba oceee eee ae 

Vie + h2 Vh2 + he 
with arbitrary sign of Vh2+ k2, since this constant enters only 
to squared terms in (h, k),. If X, » approach a pair of values 
A» &, for which (A, “),, vanishes, then of the roots of the equation 


P(P3 A, KV =A, Bn + (As B)n+41P +---=0, 
one or several become indefinitely small. 

Of course we may exclude the case where all the quantities 
(A, #)n+»(Y21) simultaneously vanish; for then ¢(p; A, w)=0 
for every arbitrary small value of p, and consequently /(a), Yo) 
is neither a maximum nor a minimum. 

We have next to see whether among the roots of d(p; A, u)= 0, 
which become indefinitely small when (A, /),, becomes indefinitely 
small, there are veal roots or not. Jf no real roots appear, then 
r>0 and f(a, Yo) 8 & maximum if the semi-definite form (h, k)n, 
when it does not vanish, is negative, while it is a minimum tf (h, k)n 
1s positive. 

When there appear real roots the investigation may be carried 
out as follows: In order to consider the function ¢(p; A, /) in the 
neighborhood of the point A,, “,, we write 


(7) AN=A, FU, HH KATY, 


where wu and v are variable quantities. 
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Since A2+ w2=1 and a2 + 42=1, we must have 
w+ + 2rAw+2 u,v =0, 


where it is certain that one of the quantities A, or mw, is different 


from zero. 


If »,# 0 we have at once from the equation just written 


v=— we tv we —(2A,u + u?*), 


where the positive sign is taken with the root, since from (7) w and 
v vanish simultaneously. Further, noting the development 


(2X ee 


Vit Brn ti) = mS (— 17 ($) OE 


it is seen that 


xr 
(8) =k, ai oee 
Be ah ee 
and if X, + 0, 
Be 1 2 
9 (ps (Ey ee Wes are 
(9) Vane 


Writing these values in $(p; A, w), we have 


$a (u, p)=(r.+ aay ee :) 


io n 


Xr 
+ (re He *Eu—---) Pata oss 
Ke ntl 
and a(t, p)= (Xo BEV — ++, Hy +0) 
+( Be, wet) PE ES 
Ng n+1 


b 


which for sufficiently small values of |w| and |p| or of |v| and |p| 
are certainly convergent and may be arranged in powers of u 


and p or of v and p. 


Since (A,, Hz)n= 0, it is seen that ,(0, 0)=0; the case that 
¢,(0, p) vanishes identically may be excluded, as has already 


been remarked. 


If p? is the lowest power of p in $,(0, p), the equation $,(0, p) = 0 
has exactly p roots p, which become indefinitely small with w or v. 
We must next see whether there are real roots among these p roots. 
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If the equation $,(u, p)=0 has no real root p which becomes 
indefinitely small with w or y, then for any arbitrarily small posi- 
tive quantity ¢« a positive quantity 8 cannot be found so small 
that in the interval —e<p<e there is situated a root p of 
,(u, p)=0 or of $,(v, p) which is different from zero and which 
belongs to a value w or v in the interval —8<w<6 or —8<v<6. 
Hence there exist positive quantities 5 and e so small that the 
function ¢, which vanishes simultaneously with w and p or with 
v and p in the region 


—6<u<6 or —8<v<8, —e<p<e 


takes values that are different from zero on every position w, p 
or v, p which is different from 0, 0, and these values have neces- 
sarily the same sign. For if ¢,(u’, p')>0 and ¢,(w", p”’)< 0, 
then with a continuous passage from the position w!, p’ to 
the position w’’, p’, which both lie within the interior of the 
realm in question and which passage does not pass through the 
position 0, 0, there must be a position wu, py) at which ¢,(u, p) 
vanishes; but there are no such positions. It follows that 
¢,(0, 0) is itself a maximum or minimum provided the equa- 
tion ¢,(u, p)=0 has no real root which becomes simultaneously 
indefinitely small with uw or v. Inversely, it is also true that if 
¢,(0, 0) is a maximum or minimum of ¢,(w, p), the equation 
$,(u, p)=0 has no real root which becomes indefinitely small 
with w or v. 

If, on the other hand, the equation ¢,(u, p)=0 has real roots 
which become indefinitely small with w or v, then ¢,(0, 0) is 
neither a maximum nora minimum; and vice versa, if ¢,(0, 0) 
is not a maximum or minimum, then in every region as small 
as we wish —8<u<56 or —d<v<6, —e<p<e there are posi- 
tions uv, p or v, p which are different from zero and for which 
$,(u, p) or $,(v, p) are zero. 

Through the above consideration the criterion whether the 
equation ¢,=0 has or has not real roots which become indefi- 
nitely small with w or v is reduced to the investigation whether 
¢,(0, 0) is a maximum or minimum of ¢,(w, p) or $,(v, p) or not. 
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We.have, therefore, to apply the criteria of Cases I and II of 
§ 42; that is, to arrange ¢, in dimensions of w and p or of v and 
p and to see whether the terms of lowest dimension form a definite 
or indefinite form. 

This same process must be applied to each of the m real linear 
factors wp,’ —A,M(o =1, 2,---, m) that are different from one 
another (p. 65), it being evidently sufficient, since w and v become 
simultaneously indefinitely small, for those linear factors in which 
r, and mw, are both different from zero to consider only one of 
the functions $,(u, p) or $,(v, Pp). 

44, We have, then, the following rule for Case III: 

If the developments of the functions $,(u, p), b2(U, P),** +> 
dm(u, p) all begin with definite forms, then f(&), Yo) 18 & max- 
imum when the semi-definite form (h, k),=0, while it is a min- 
imum Uf (h, k)»= 0. 

If only one of the functions },(u, p) begins with an indefinite 
form, then f (%, Yo) 1s neither 4 maximum nor a minimum. 

The case remains undetermined if among all the functions 
,(u, p) none of them begins with an indefinite form, while one or 
several of them begin with a semi-definite form. 


In this case, for every such function the above process must 
be again applied. We do noé affirm that by using this method 
a determination may among all conditions be made; but Von 
Dantscher says “if the method, which has been developed to see 
whether a series g(h, k) which begins with a semi-definite form 
has or has not on the position h =0,4=0 a maximum or mini- 
mum, fails, the function g(h, %) contains an even power of a 
series P(h,k) which vanishes for real pairs of values h, & in every 
region arbitrarily small 0 <|h|< 8, 0<|k|<8&” (see § 41). 

Example 1. Peano’s classic example: 

g (hy k) = KP — (p? + g?) Wk + pgrht. 
We have here 
P(p3 Ay M) = Mw — (p? + G*) up + p?q?rép®. 
The semi-definite form y? has the linear factor p so that either 


A =1, w, =9, or A, =—1, py = 9. 
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The corresponding values of \ and p, are (see [7] and [9}) 
A=1—}v?—.---, n=, 
so that $1 (0 p) = 0 — (p? + Pup + p'gtp® + (0, p))g Hoe 


The terms of the second dimension in v’and p form an indefinite quad- 
ratic form, so that g(0, 0) is neither a maximum nor a minimum. 


Example 2. Let g(h, k) =— W2(h — k)2 + 2 hk — 5 WS + 3 18K 
+ WKS — 7 WR? + 6 W8ke — 10h + APH 
Duper Ae|) Bi ee 
We then have fa poees i 
(p53 Xs W)=— ANywP(A—p)? + (2AwE— 5 25+ 3 yt + MuF—T A y2+ 6 ASW) p 
+ (—10A8 + A2u8 + 3 AA i= 42) 9? once 


We have here to consider the three linear factors 
PyA—Ayw= A, ploA— App = py gh — Ag = A— pe 


It follows that 
i it 


A=0, m=l, AQ=—-L w=; Mis rye ea 5% 
To these values correspond the expressions : 
N= Oh pma=1l—tv—-..., 
mar, A=—14 fv%+- 
1 i 
Seige ag 
$, (4% p)=— wt 2up—4p?4+---, 
$,(v, p)=— v2 + 6up—10p? +---, 
$;(u, p) = — w+ gup— Bp*4+---. 


It is seen that all three of the functions ¢@ begin with definite quad- 
ratic forms. The semi-definite initial form is negative when it is not 
zero; and accordingly g(0,0) is a maximum. (Von Dantscher, Math. Ann., 
Vol. XLII, p. 100.) 


We thus have 


PROBLEMS 


1. Show that g(0, 0) is neither a maximum nor a minimum of the function 
g (h, &) = hPkA — 8 AS + 18K? — 3h + I — 100K + 51”. 

(See Ex. 5, p. 62.) 

2. Apply this method to Ex. 3, p. 61. 

3. Tf 2#=a—2?-7?+(ecosa+ysina)’, find maximum and mini- 
mum yalues of z and give the geometric interpretation. 

4, If 22=2av2?4+ y—2?+ y%, find maximum and minimum values of z; 
show that there are improper extremes and give geometric signification. 


5. Find minimum value of u, where u = (a? + ys. 
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V. FUNCTIONS OF THREE VARIABLES 
TREATMENT IN PARTICULAR OF THE SEMI-DEFINITE CASE 


45. The theorems and proofs given by Stolz and Scheeffer for 
functions of two variables may be extended at once to functions 
of three or more variables. For example, /(Zp, Yo, 29) 18 a proper 
maximum of f(z, y, 2) if a positive quantity 6 can be so deter- 
mined that for all systems of values &, 7, & whose absolute values 
are smaller than 6 (excepting § = 0 ==), we have 


S(®+ = Yo 0; % = 9) —F (%; Yo: Zo) <0. 
If the partial derivatives of /(«, y, z) have definite values at 
every position of a fixed realm #, the codrdinates xo, ¥, 2 of 


those positions (if any) in & which offer extremes of the function 
J (&, y, 2) must satisfy the equations 


lzekn? Lahn® [eel=* 


To apply the Stolzian theorem we observe, if we limit ourselves 
to a position z=0=y¥),=2, that the collectivity of positions 
x, y, for which ||, |y|, |z| are less than 6 are distributed into 
three kinds of realms :* 


(1) always with |~|<6, 2 constant, and 
ly|=|2|, [2]= 25 
(2) with y constant and |y|<68, where also 
ZlSlyb lel Slyls 
(3) with z constant and |z|<6 and 
7|Slel [yl = lI. 
To apply the Scheeffer theorem we must consider the difference 


I (2, Y, 2) == 10; 0, 0)= Gu (X, Y, 2) + oa (2 Y, 2), 
as in § 30. 


The case where G,,(z, y, 2) is a definite or indefinite form is 
treated fully in Chapter V. 


* Stolz, loc. cit., p. 237. 
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46. The case where G(x, y, 2) is a nonhomogeneous function 
in which the terms of the lowest dimension constitute a semi- 
definite form may be treated in a manner analogous to that 
given in §§ 37-41, as follows: 

We first determine the upper and lower limits of G(z, y, 2) 
with constant « and |y|=|2|, |z|=|«|. Geometrically interpreted, 
this realm constitutes a square whose center is the origin and 
whose sides are parallel to the y-axis and the zaxis, the length 
being 2|z]. ; 

The positions at which G(a, y, 2) reaches one of its limits may 
lie (1) on the vertices, or (2) on the sides, or (3) within the 
interior of the square. 

We have, consequently, to form the expressions corresponding 
to the four vertices 


G(x, £, 2), G(@, %, — x“), G(@%, — x, %), G(x, — %, — 2). (2) 


For points on the sides we have to solve for y the equations 


(2) OG (x, Y, 2) See OG (XL, Y, — &) rol 
oy oy 
and for z the equations 
OG (@, &, 2) _ OG (x, — X, 2) “is 0 
(B) rege a ae 0 and cada 7, aeeaiad 


Let the solutions of the equations (a) be 
Y=E@), y=R@) 
and let the solutions of (8) be 
2=Q1 (2), 2= 93 (%), 
Those functions P(«) and Q(«) which cause y and z to fall without 


the given square are to be neglected (cf. § 38). 
With the remaining functions we form the expressions 


G(a, Ra), 2),---; E(w 2, Q(2)) (ii) 
For the points within the square we have to determine y and z 


in terms of « from the equations 


OG (&, Yr @) _ qlee OG (x, Y, 2) 


= 0, 
(Y) By Ae 
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If we eliminate z from these two equations, we may express y as 
power series in ~ without constant term, say y= , (z), ¥ = $2 (2), 
.-+(§ 29). To each such power series for y, say y= (2), there 
corresponds one for z in terms of ~, say 2=A(zx), which two series 
written in the two equations (y) cause them to vanish identically. 
With these values of y and z we form the expressions 

G(x, Py(#), (2), (2, bal), Ao())- (wi) 
Among all the functions that are found in (7), (#), and (wt) we 
are now able to determine those two which offer the upper and 
lower limits of the function G(x, y, z) within the interval in 
question. These limits may be denoted by G(x, ®,(x), A,(2)) 
and G(x, B,(~), A,(2)). 

If, next, we take y constant and |x| =|y|, |z|=|y|, we may derive 
in a similar manner the upper and lower limits G(V,(y), y, M,(y)) 
and G(V,(y), y, M,(y)). Finally, with z constant and |«|=|z\, 
|y|=\|z|, we derive the upper and lower limits 


G(Na(2), Qo(2), 2) and G(Nj(z), Q4(), 2). 


The Stolzian and Scheefferian theorems are at once applicable 
to these six functions in three variables, the method of procedure 
being an evident generalization of these theorems for the functions 
in two variables. 


PROBLEMS 


1. Make the extension and generalization of Von Dantscher’s method 
to the treatment of functions in three variables. 


2. In the line of intersection of two given planes find the nearest point 
to the origin of codrdinates. 


a 


CHAPTER V 


MAXIMA AND MINIMA OF FUNCTIONS OF SEVERAL VARIABLES 
THAT ARE SUBJECTED TO NO SUBSIDIARY CONDITIONS: 


J. ORDINARY EXTREMES 


47. It will be presupposed in the following discussion, unless 
it is expressly stated to the contrary, that not only the quantities 
that appear as arguments of the functions but also the functions 
themselves are real, and that the functions, as soon as the vari- 
ables are limited to a definite continuous region, have within this 
region everywhere the character of one-valued regular functions. 
Regular functions are defined in the following manner: A fwne- 
tion f(x) is regular within certain fixed limits of x if the func- 
tion ts defined for all values of «x within these limits and if for 
every value a of « within these limits the development 


fatm=fatEsf@t Ss at-- 


as possible ; the serves must be convergent and must in reality (see 
§ 136), represent the values of the function within this neighborhood. 

In other words: A function f(x) is regular in the neighbor- 
hood of the position «=a if the function in this neighborhood 
has everywhere a definite value which changes in a continuous 
manner with x. (Cf. Weierstrass, Werke, Vol. II, p. 77.) 

A one-valued analytic function f (a1, %,+++; Ln) of several vari- 
ables behaves regularly on a definite position (#1 = G1, Uy= M,+°°; 
In= Ap) If in the neighborhood of this position we may express 
the function through a series of the form 


9 mast Vn 

Das DA? 2 28 yn (Ly — %y)"(L_— Ug)’? + + + (Ly— An)”, 
WNETE Vy, Vyg,+**, Vy are positive integers or zero, and where the 
coefficients A,, .,++ +s, are quantities that are independent of the 


variables. (Cf. Weierstrass, Werke, Vol. II, p. 164.) 
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The discussion is thus limited to such functions as are analytic 
structures of the nature described more in detail in §§ 130, 131. 
Only for such functions can we derive general theorems, since 
for other functions even the rules of the differential calculus 
are not applicable; in other words we shall consider only the 
ordinary extremes. 

The problem of finding those values of the argument of a 
function f(x) for which the function has a maximum or mini- 
mum value is not susceptible of a general solution, for, besides 
the cases of the extraordinary extremes of §§ 5-7, there are func- 
tions which, in spite of the fact that they may be defined through 
a simple series or through other algebraic expressions and which 
vary in a continuous manner, have an infinite number of maxima 
and minima within an interval which may be taken as small 
as we wish.* Such functions do not come under the present 
investigation. 

48. We say (see § 1) that a function f(x) of one variable has 
a proper maximum or a proper minimum at a definite position 
e=a vf the value of the function for «=a is respectively greater 
or less than wt is for all other values of x which are situated in 
a neighborhood |x —a\<6 as near as we wish to a. 

The analytical condition that f(x) shall have for the position 
c= 
a proper maximum, is expressed by /(«)—f (a) <0 


a proper minimum, is expressed by f(x)—f(a) >0 } for aa 


In the same way we say a function f(x, %,-+-+, %,) of n 
variables has at a definite position «= a, ty=Ay,+++, y= My 
a proper maximum or a proper minimum if the value of the 
function for %,= 41, Ly=Ay,-++, Ln= Gy 18 respectively greater 


* A function may have in an interval as small as we wish 


(1) an infinite number of discontinuities, 
(2) an infinite number of maxima and minima, 
and still be expressed through a Fourier series. 
See, for example, H. Hankel, Ueber die wnendlich oft oscillirenden und unstetigen 
Functionen (Tiibingen, 1870); Lipschitz, Credle, Vol. LXII, p. 296: P. du Bois- 


Reymond, Abh. der Bayer. Akad., Vol. XII, p. 8, and also same volume, Part II, 
Math.-Phys. Classe (1876). 
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or less than ut is for all other systems of values situated in a 
neighborhood 
2 |%—a|<8, (A=1, 2,---,n) 

as near as we wish to the first position; and the analytical 
condition that the function /(#,, %,-+-, #,) shall have at the 
Position %— 4), Ly Ap, +++, y= A, 


a proper maximum, 18 f(y, %q,++ +, Ly) —f (Ay Ag, +++, An) <0, 


a proper minimum, is f(%, Xy,+++, Ln) — f (a4, Aq, +++, Ay) > 0, 


for |z,—a,|< 6, (A=1, 2,---, ”), where the quantities 6, are 
arbitrarily small. Improper extremes take the place of the 
proper extremes above when we allow the equality sign to 
appear with the inequality sign, as in § 1. 

49. The problem which we have to consider in the theory 
of maxima and minima is, then, to find those positions at 
which a maximum or minimum really enters. 

We shall give a brief résumé of this problem for functions 
of one variable and then make its generalization for functions 
of several variables. 

If z,, #, are two values of «x situated sufficiently near each 
other within a given region, then the difference of the corre- 
sponding values of the function is expressible in the form: 


Ly) — f (& 
I ( 2) I ( D = f! (x, + 0 (0ty— ,)), 
PN iah 
where @ denotes a quantity situated between 0 and 1; or, if 
z, is written equal to # and 4,=a+h, 
[1] f(e+h)— f (2) = hf! (© + 8h). 
From this theorem may be derived Taylor’s theorem in the 
form,* 


2] fle@+h-f@=hf@)t ss +o 


if 
(n —1)! 


= hn-LeeoD (a) + + hm f (w + Oh). 
N: 


* See Jordan, Cours D’ Analyse, Vol. 1, §§ 249-250, 
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In the two formule last written, instead of +h write x 
and write a in the place of x; they then become 


(1°) f(a) — f(a) =(@ — a) f"(a +(e — a) 


and 
[27] f(x)— f(ay=" eC ote ay g(a) + ++ 


(% — 0 
(n — 


-E— Ope os eer Of (+8 (#7 — a). 

Since f(z) is a regular, and consequently continuous, function, 
the same is true of all its derivatives. If /’(a) is different 
from zero, then with small values of A=xz—a the value of 
f'(a+6h) is different from zero and has the same sign as /” (a). 

According to the choice of 4, which is arbitrary, the differ- 
ence f(«)— f(a) can be made to have one sign or the opposite 
sign, if f’(a) is either a positive quantity or a negative quantity. 
Hence the function f(z) can have neither a maximum nor a mini- 
mum value at the position «=a if f(a) =0. 

We therefore have the theorem: Laztremes of the function 
S(«) can only enter for those values of « for which f'(x) 
vanishes (see § 2). 

It may happen that for the roots of the equation /'(z)=0 
some of the following derivatives also vanish. If the nth deriva- 
tive is the first one that does not vanish for the root 2 =a, then 
from equation [2°] we have the formula 


F(a) —f(q=——™" (e+ 6 (@ — a), 


and with small values of h = «—a, owing to the continuity of 
f(z), the quantity /™(a+6h) will likewise be different from 
zero and will have the same sign as f™/(a). If, therefore, 7 is 
an odd integer, we may always bring it about, according as h is 
taken positive or negative, that the difference /(x)— f(a) with 
every value of 7 (a) has either one sign or the opposite sign; 
consequently the function f(z) will have at the position «=a 
neither a maximum nor a minimum value. 
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If, however, m is an even integer, then A” is always positive, 
whatever the choice of 4 may have been; consequently the 
difference f(x) — f(a) is positive or negative according as (a) 
is positive or negative. 

In the first case the function f(z) has a minimum value at 
the position «=a; in the latter case, a maximum. 

Taking this into consideration we have the following theorem 
for functions of one variable (§ 3 

Lixtremes of the function f(x) can only enter for the roots of 
the equation f' (x)= 0. If a is a root of this equation, then at the 
position «=a there is neither a maximum nor a minimum of the 
Jirst of the derivatives that does not vanish for this value is of 
an odd degree; if, however, the degree is even, then the function 
has a maximum value for the position x=a vf the derivative 
for x=a ws negatwe, a minimum if it is positive. 

50. To derive the analog for functions of several variables, we 
start again with the Taylor-Lagrange theorem * for such functions. 
This theorem may be derived by first writing in /(2,, %,+-++, 2) 

L =a, +U(%,—a,), (N=1, 2,-+-+, 2), 
where w is a quantity that varies between 0 and 1; we then apply 
to the function 


b(uy=f(a+ U (%y — My), Ag+ WU (hlg— Ay), ++ +5 Uy U(%_ — Gn)) 


Maclaurin’s theorem for functions of one variable, viz.: 


[3] HFT HOA THO + 


mea i aypoe hes ~ $(8u), 
and, finally, in this expression write w=1, as follows: 
For brevity denote by /,(#1, %,+++, %) the first derivative of 
SF (£1) Bq ** *> €n) With respect to a, and by fp, x,(%» %.°* +s Ln) 
the derivative of f(a, %,+++, %) With respect to %, and 2%, 
that is, @2f(00,, Bg, ++ +5 Ln) 


inn i obama = » etc. 
ky? yf 13,°"Q 9 ? n) City, OX, 


*See Lagrange, Théorie des Fonctions, p. 152. 
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It follows, then, that 
k=n 
¢' w= A fe(art W(@y— 4), +05 Uy U(E_ — tm) ) (pe — ay) }s 
p" (u) = 2 (Sis ig + U( U(Iy— Ay)y °° 


ky bs Gn +u (Ly me On) (Hp,— ay.,) (Xx, op. Oy.) }> 


pm -D(y) = Da, tier payee (0 U(z,— 4), +** 
ky, hay + ++, km — e : 
= ; On U(XL_— On) (Xx, 4) Ph (Lx, = a Geos 


pb) (Ou) = 2 Fito Ii (44 + Ou (x,— ay), +++, 


Key, kg, +++) Km On + Ou (x, — On) )(Le,— Ay.) a) (% — ay,,) }- 
Hence, from [3] we have 
u 
pee Gg, ++ ty On) = eae gy + + +) Un) (Zp— Ay)} 
k 


2 hk (44, gy ** +) Mn) (Ze, — V%,) (Le, — %,)} 


AF jaa On ay + Se. Ke, Mehr | km —1 (25 M9, cpap? an) 
) i ky, kg, ++ > Kn m—1 
. (Zi, — Ue.) +++ (Lim —1— Vem —a)} 
Soar > {Fee tgs «++, Fen (44 + OU (4 — Ay), +, 
Oy ass ciagsy 2 
le = Ay Ou (Lp — Gn) (2, — ay.) + 0 yy) } 


From this it follows, if we write «=1, that 


J (#1) X; aes +, Lp) — f (Gy, %, saa tn) = Zi Solty Go,+* Gy) (& — a,)} 


ur 21,4 a Keay ky (21s Lg, +) Gn) (Ly, — M,) (Le, — By,)} 


= 
FGDs ug Eh, tte Beal las = +9 Ma) ag ie) 
(he By, —1)} 
are , >; { Fi, hes vey en (+8 (4 — a), y+ 8 (x, — Ay), ++, 


ty Key y sy Kem 0 
On + 8 (x, — Gn) ) (2p, — Ay.) ei (2x, es O,)$- 
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51. We are not accustomed to Taylors theorem* in the form 
just given; to derive this theorem as it is usually given, observe 
that upon performing the indicated summations each of the in- 
dices k,, k,,---, independently the one from the other, takes all 
values ffom 1 to m, so that the Ath term in the development is 
a homogeneous function of the Ath degree in #,— a1, %,— p, 
+++, LZ, —G,. The general term of this homogeneous function may 
be written in the form 


— D- N+ (&1— ay)" (&_ — g)** + + + (&y,— Qn)"; 
where AptAgt e+ +AR=A, 
D is the definite differential quotient 

ee La, ** %) Ln) 


Ox MOxe2 .- + Ox, 


2 ol Xr eee x 
) ea NTS ot =f n) (Q4, Mo, 5.0 Gy An), 
4, Ao, 1 An 


and WV is the number of permutations of ’ elements of which ),, 
Ng: +++) An Lespectively are alike; that is, 


rn! 


i é 
Moen > ones 


Furthermore, writing 7,— a,=/,, we have, finally, 


[4] S(&p Weg sicies In) — f (24, Be ,** "5 An) 


=¥ { (Le Hing ts “)) hy } 

k=1 OX}, Ay, Ag, +**, On 
lL (72% Hoy * 5 ‘) a } 

5 > 020,02, eee 


AytAgt +++ +An=m— Ash}: a0 pe 


1 
+> tee ee Bort *'s ein 


Ay) Ags tty An 


AytaAgt srs +An=m 
+> {fortrct ot) (ay+ Oh, oe +, An+Ohn) 


Ay Agy tt An 


Ah A An 
h} the O06 ie 
aaa 


* Stolz (Grundziige der Differential und Integralrechnung, p. 247) ascribes this 
mode of expression to A. Mayer (see paper by him in the Lezpz. Ber. (1889), p. 128). 
The form as presented here is found in Weierstrass’s lectures delivered at least ten 
years before the Mayer paper. 
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This is the usual form of Taylor’s theorem for functions of several 
variables. In particular, when m =1 the above development is 
[5] S(@y %as ++ +s Un) —L (Ay Ue» * + +s An) 
k=n 
= >, { fig(@y+ Ohy, +++, In + Bh,) hy}. 
k=1 
The function /(#,,--+, %,) is regular and continuous, as are 
consequently all its derivatives. If, therefore, the first deriva- 
tives of f(a, &g,+++, Z,) are all, or in part, =0 for z,=4,---, 
n» then they will also be different from zero for 2;=a,+8@h,, 
+++, Ln = An + Oh», where the absolute values of h,,h,,-++, hy have 
been taken sufficiently small; these derivatives will also be of the 
same sign as they were for 2, = @,, L2= Uy, +++, Z,= Ay. If, now, we 
choose all the h’s zero with the exception of one, which may be 
taken either positive or negative, it is seen that when the corre- 
sponding derivative has either sign, we may always bring it about 
at pleasure that the difference 


S(&y cp) mee ea In) — f (Hy, Cant 33 Gn) 
is either a positive or a negative quantity, and consequently at 
the position @,, @,+++, a, no extreme value of the function is 
permissible. 
We therefore have the following theorem: 


Ly= 


Extremes of the function f(x, &g,+++, L,) can only enter for 
those systems of values of (a1, %y,+++, Lp) which at the same time 
satisfy the n equations (p. 17) 

0 
[6] f 0 bie 0 a 0. 


’ ’ pense 
0%, Ly, 


It may happen that for the common roots of the system of 
equations [6] still higher derivatives also vanish. In this case 
we can in general only say that if for a system of roots of the 
equations [6] all the derivatives of several of the next higher 
orders vanish, and if the first derivative which’ does not vanish 
for these values is of an odd order, the function, as may be 
shown by a method of reasoning similar to that above, has 
certainly no maximum or minimum value. 
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52. If, however, this derivative is of an even order, then in the 
present state of the theory of forms of the nth order in several 
variables there is no general criterion regarding the behavior of 
the function at the position in question. We therefore limit our- 
selves to the case where the derivatives of the second order of the 
function f(a, &,+++, &,) do not all vanish for the system of real 
TOOtS Ay, Az,+++, a, of the equations [6]. 

In this case we have a criterion in the formula 


[7] J (&, gs FP =f, Ali 2 Oy 
2 eee q 
= ul >{(- oT (&4; Xo, ? =a) hyhy } 
2 ds 02,02, Gi, + Og, +++ Ont Olin 


by which we may determine whether /(x,, 2, +++, x,) has an ex- 
treme value on the position a@,, @,-++-, @,, since we may determine 
whether the integral homogeneous function of the second degree, 


> {( x4, Xo) eae “)) hyh } 
00,02, y+ hy, +++, dnt On 


ASM 
in the n variables h,, hy, +++, h, is for arbitrary values of those 
variables invariably positive or invariably negative. 
Denote this function by f,, (4+ Ohy, +++, G+ Oh,). 
On account of their presupposed continuity the quantities 


(oo Xo» ek we 2) and (ah Ha, % 2%) a 
a,+ 6h, An + Olin a. On 


020%, 020%, 


with values of h,, ho,+-++, hy taken sufficiently small differ from 
each other as little as we wish and are of the same sign; * hence 
with small values of the h’s the functions 


> (eae hyh,, } 


Cy + ONy, +++, Ant Ohn 
Of (a1, ++ ee) 

; hyh 
pos 5; {eased Hea ie 
have always the same sign, and we may therefore confine our- 
selves to the investigation of the latter function. 


ce (OF Bry 2011 En) 
* If any of the quantities ean 
be 


it by €rau(@y, +++, An), which must of course be given the same sign as fay (a+ fy, «+ 
Ant Ohn), ru denoting an infinitesimally small quantity. 


ae becomes zero, we may replace 
ae? an 
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If it is found that through a suitable choice of hy, hg,-++, hp 


the expression 
+ {(L Zo, tes, =2)) Ayhy } 
cai 0”, 0%, nig el de 


can be made at pleasure either positive or negative, the same will 
be the case with the difference /(2, %, +++, %n)— (4, %,°* +, U%), 
and consequently /(a,, %,+++,%,) has on the position (a1, @,- - +, 4») 
no extreme value. 

We therefore have as a second condition for the existence of 
a maximum or a minimum of the function /(z,, %,--+-, Z,) on 
the position (a, @,+++, G) that in case the second derivatives 
of the function f(z, #,,+++, %,) do not all vanish at this position, 
the homogeneous quadratic form 


0x, 0x, 5 Gey - an 


Aye 


must be always negative or always positive for arbitrary values 
of hy, he, +++, hn. 
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53. The three kinds of quadratic forms, viz. definite, semi- 
definite, and indefinite, were defined in § 13. 

As we have already indicated in § 13, it is seen that if the homo- 
geneous function is an indefinite form, the function f(2,, Z, - - +, Z,) 
has neither a maximum nor a minimum upon the position (a,, a, 
+++, @,); for if the right-hand member of [7] is positive, say, for 
a definite system of values of the h’s, then in accordance with 
the definition of the indefinite quadratic forms we can find in 
the immediate neighborhood of the first system a second system 
of values of the h’s for which the right-hand side of the equa- 
tion [7] is negative; consequently, also, the difference 


J (%, Ho, eee ee In) — f (ay, Mo, i? te a) 


is negative, so that therefore no maximum or minimum is permis- 
sible for the position (a1, @,++-, d,). 


Ee 
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If, then, the second derivatives of the function F Cie C,) 
do not all vanish at the position (44, Gg, +++, dy), it follows, besides 
the equations [6], as a further condition for the existence of an 
extreme of the function /(a,, 2,,-++, 2) that the terms of the 
second dimension in [4] must be a definite quadratic form, if we 
exclude what we have called the semi-definite case. 

The question next arises: Under what conditions is in general 
a homogeneous quadratic form 


[8] (x, Hy, weeks Bn) => AnD ,k ys 
A, Me 


Ay, = Ayn 
a definite quadratic form ? 

54. Before we endeavor to answer this question we must yet 
consider some known properties of the homogeneous functions of 
the second degree. 

Suppose that in the function $(z,, %,---+, ”,), in the place of 
(@, &,+++, L,), homogeneous linear functions of these quantities 

hen 

[9] Yr= 2 Cnyu% (N=1, 2,---, 2) 

w=1 
are substituted, which are subjected to the condition that 
inversely the z’s may be linearly expressed in terms of the ’s, 
and consequently the determinant 


Crys CyQ5°*° > Sn 

Cor» “g9>°* 19 Con sh 
[10] ee DE Sixlag + Can =O. 

Cn 1» nas * °°» nm 


The function (z,, 2, +++, %p) is thereby transformed into 
[11] P (Ly, Ly, * 2 +5 Un)=W (Yrs Yos* + +s Yn): 


Owing to this substitution it may happen that W(y, Yo,++ +s Yn) 
does not contain one of the variables y, so that $(#,, @, +++, %,) 
is expressible as a function of less than ” variables. 
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To find the condition for this write 
12a 
[12] dp, = ; si =D Aut (A=1, 2,---,7). 


If y is independent of one of the y's, say Y, so that conse- 
quently oY = 0, then from the n equations 


[13] “Ta cB()- Si (mse) ORL Beem 


w=1 
Ow @ 
we may eliminate the fe , v , 
Ow ; Y, Ys 
OF - We thus have among the ¢’s an equation of the form 
Yn-1 
=n 


[14] hip, = 0, 


w=1 


where the &’s are constants. 
Owing to equations [12] this means that the determinant of 
the given quadratic form vanishes, that is, 


[15] > tA 1rAdg + ++ Ann = 9- 
We note here the following formulas: 
Dan tay +5 ty) =F Ant 
DRC NCE EE ALA Sgt 1p, 


A 
= a lees 
. A, XjZy, = > Aj, Xp, 3 


[16] 


and consequently 
[17] oS {Pa (yz) 2g, ++ +5 Lp) => {p, (a1, Way +++, Ly) ay}. 


There exists, further, the well-known Euler’s theorem for homo- 
geneous functions: 


[18] > {h, (%, Hares, Xn) ©} = p (x, tgp * 25 In). 


It is also easy to show reciprocally that if, as above, the equa- 
tion [15] is true, the function ¢ consists of less than » variables. 
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For if we assume that equation [15], or, what amounts to the 
same thing, an identical relation of the form [14] exists, and if 
we substitute in p(2,, %,-++, 2,) the quantities x, +¢k, in the — 
place of z,(A=1, 2,---, ) and develop with respect to powers 
of ¢, we then have 


b (2+ thy, %y+ thy, +++, B+ thy) 
a (x, X95 ie) Lp) = 2t > Fare, Xo» peur s In} 
= a) (Ay, Kos ees Kn) 


It follows, when we take into consideration the equations [14] 
and [18], since the equation [14] is true for every system of 
values (X,, +++, Zp), that 

(a+ thy, ied ate thn) ae p (#4; oes be) Tn)» 
Hence, if the equation [15] exists, or if the X’s satisfy the equa- 
tion [14] for every system of values (4, 2%, +++, Xp), then 
(x1, %,+++, Z,) Temains invariantive if x,+ tk, is written for 
“,, where ¢ is an arbitrary quantity. 

Consequently, it being presupposed that 4,=0, if ¢ is so 
determined that the argument x,+tk,=0, we have 

ny oe oe 24)= («4 — A a, ie Ax, os 

Lyi “ale, Ua ae Pile 4-2 a), 
where ¢ is expressed as a function of less than m variables. 
We therefore have the theorem 

The vanishing of the determinant >) +Ayy Aggy? +*Ann ts the 
necessary and sufficient condition that a homogeneous quadratic 
function (#1, +++; in) = > Ary a x, be expressible as a func- 
tion of n—1 variables. Ode 

55. We return to the question proposed at the end of § 53, 
and to have a definite case before us assume that the problem is: 
Determine the condition under which the function (2%, Lq, +++, %n) 
is invariably positive. The second case where $(2,, %, +++, 2n) 
is to be invariably negative is had at once if —@ is written in 
the place of ¢. 
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We shall first show, following a method due to Weierstrass,* 
that every homogeneous function of the second degree p(x, Xo, 

++, &,) may be expressed as an aggregate of squares of linear 
functions of the variables. 

56. In the proof of the above theorem it is assumed that 
f(x, Ly,+++, ,) cannot be expressed as a function of x —1 
variables; it follows, therefore, that the inequality 


[20] > +4nA4e eee Ane = 0 


is true and that therefore it is not possible to determine con- 
stants k, so that the equation Dugi= 0 exists identically. 
i=1 
If, then, y is a linear function of the z’s having the form 


(2a) Y = 6,2, + tg = = lady, 


and if g is a certain constant, then the expression ¢— gy?(= 4d, 
say), after the theorem proved above, can be expressed as a 
function of only n—1 variables if the constants k,, k,,---, ky 
may be so determined that 


Lids =Sith (¢,— gye,)}= 0 


or 
A=n a 


[22] > OX Se >, kye,= 0. 
ail wil 


From the assumption made regarding [20] it follows, on the 
one hand, that the inequality 


[23] > Faz 0 
A 


must exist. This is the only restriction placed upon the ¢’s. On 
the other hand, in virtue of the ” linear equations 


[24] >a io $y (A= L 2, P SFa sa) 2) 


* See also Lagrange, Misc. Taur., Vol. I (1759), p. 18, and Mécanique, Vol. Types: 
Gauss, Disq. Arithm., p. 271 ; Theoria Comb. Observ., p. 31, ete. 
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the quantities z,, ,-++-, x, may be expressed as linear functions 
of 1, $2,--+, b», and, consequently, by the substitution of these 
values of 2, %,+++, %, in [21] y takes the form 


v= 


[25] y= Dob 


where the e, are constants, which are composed of the constants 
A,, and ¢. 
But from equation [22] it follows that 


ee ya bby 
9 >) Tine, 


Such a representation of the ¢,, however, since we have to do 
with linear equations, can be effected only in one way. 


=n 
Hl Dy kuby A=N 
We therefore have y = — =Sieanhy 
Db ubn ee 
n=l 


from which it follows that 


B= 


k=! ge, DA (A=1, 2,.-., n). 
w= 


Through the substitution of these values in [22] it is seen that 


A=n 


A= 
dD exdr— IY >, xe = 0; 
A=1 


Ne 


consequently, owing to the relation [25], we have 


A=n 1 

[26] ae Op 
dents 

A=1 


This value of g may be expressed in a different form; for from 
[25] and [17] it follows that 


v= 


[26°] Y => 4b(%p ee 9) = Zn lev ly3 ++ +s On)- 
v=) = 
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Comparing this result with [21], we have 
[27] C= dh, (61, C9) fee ig Cn) (v=1, 2, et N), 


and consequently 
1 : 
[28] o> Fea eae re 


Dd era(er, €g3°* +s €n) 
A=1 
— 1 . 

p(y, ane en) 

Since the quantities ¢,, c,,+-+, ¢, are perfectly arbitrary except 
the one restriction expressed by the inequality [23], the quantities 
€15 €y,***) n are, in consequence of the equation [27], completely 
arbitrary with the one limitation resulting from [28], viz., the 
function @ cannot vanish for the system of values (é,, é),+++, €n); 
otherwise g would become infinite. 

57. Reciprocally, if the quantities ¢,, ¢,,---, €, are arbitrarily 
chosen, but with the restriction just mentioned, and if g is 
determined through [28], it may be proved that the expression 
$(=¢— gy"), where y has the form [25], may be expressed as 
a function of only n—1 variables. For, form the derivatives of 
this expression with respect to the different variables, and multiply 
each of the resulting quantities by the constants ¢,, ¢,---, @,. 
Adding these products and noting [26“] and [28], we have 


> aPa- IVD) exPr Cv» Cas 2 tes end = > er Pa— y- 
The expression on the right-hand side is zero from [25]. Hence 
m constants may be chosen in such a way that the sum of the 
products of these constants and the derivatives of the expression 
¢—gy* is identically zero, and also $(e,+++, e,)=0 (cf. [18}). 
58. Substitute 2,+ te, for x,(N=1, 2,---, n) in d; if one of 
these arguments is made equal to zero, we have, as in § 54, 


or, from [18], g 


p (a; Baseeey Lp) — gy" 
= € e € e 
tes bs pet = 
= 6(x, yy sss, Ly — KR? ay, 0, © yyy — Bt ay, ++ +, ay — ay) 
ey, er Cx ke 
or, if the new arguments are represented by a’,, z',---, a! 


p(x, Ba,*%%, Ly) e 3 gy = (2's, as, 2) a ,-1)- 


n—-1? 
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Employing the same method of procedure with be, ae, a! 1) 
as was done with ¢ (a, #,+++, Z,), we come finally to the func- 
tion of only one variable, which, being a homogeneous function of 
the second degree, is itself a square. Hence we have the given 
homogeneous function $(2,, %%,-+++, Zp) expressed as the sum of 
squares of linear homogeneous functions of the variables. If the 
coefficients of are real, as also the quantities ¢, the coefficients y 
are also real, and since the quantities e may with a single limi- 
tation be arbitrarily chosen, it follows that a transformation of 
such a kind that the result shall be a real one may be performed 
in an infinite number of ways.* 

59. If, now, the expression 

[29] P(X, %ys +++) Ln) = Yi + Goa + +++ + Inn 
is to be invariably positive for real values of the variables and 
equal to zero only when the variables themselves all vanish, then 
all the qualities 9,, 9,,+++, J, must be positive; for if this were 
not the case, but g,, say, were negative, then, since the y’s are, 
independently of one another, linear homogeneous functions of the 
a’s, we could so choose the 2’s that all the y’s except y, would 
vanish, and consequently, contrary to our assumption, $(%,, %., 
-++,%,) would be negative. Furthermore, none of the g’s can vanish ; 
for if g,, say, were zero, we might so choose a system of values 
21, +++, L,,in Which atleast not all the quantities 2, 7, +++, %p 
were zero, that all the y’s would vanish except y,, and consequently 
¢ could then be zero without the vanishing of all the variables 
Bay bag 0+ %> Lye 

Reciprocally, the condition of 9,, Ja,+++s Jn being all positive is 
also sufficient for ¢ to be invariably positive for real values of 
the variables, and for ¢ to be equal to zero only when all the 
variables vanish. 

60. In order to have, in as definite form as possible, the ex- 
pression of ¢ as a sum of squares, we shall give to the expression 
[26] for g still a third form. 


* See Burnside and Panton, Theory of Equations (1892), p. 430. In this connec- 
tion it is of interest to note the Theorem of Inertia of Sylvester, Coll. Math. Papers, 
Vol. I, pp. 380, 511. See also Hermite, Huwvres, Vol. I, p. 429. 
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In connection with [12] it follows from [27] that 
h= 
= > A,,@, (v=1,2,---, ). 
w=1 


Denote by A the determinant of these equations, which from [20] 
is not identically zero, that is, 


[30] A=) + AyAga + + Ann: 


We have as the solution of the preceding equation 


1" 0A 
Cae Cy (#=1, 2,---, n). 
4A, 


It follows from this in connection with [26] that 


A 
[31] = 


an expression in which the c’s are subject only to the one con- 
ition that 
dition tha oY ia 
p> aAy 
is not identically zero. 


61. It is shown next that we may separate from $(w,, %, 
-+, 2) the square of a single variable in such a way that the 
resulting function contains only » — 1 variables. 

For example, in order that the expression ¢ — gx? be expressed 
as a function of » —1 variables, we may choose for g the value 
[31], after we have written in this expression c.=0(X=1, 2, 
-++,2—1), while to ce, is given the value wnity. 

From this it is seen that 


whe’ 2 Bs 
ey eee 
OAnn, 


where A, is the determinant of the quadratic form $(a,, a, ---, 
%,_, 9). Of course this determinant must be different from zero. 
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Hence we may write 


A 
P(X; Lg, *, Ly) = A ate 4. $ (x, th Shoes oe 1)s 
1 


where 


TiO eal ! exe ey 5 C- 
(x4, hyo Oh) =$( = Ly 7 em 5 ee met not ns F 
En Cn Cn 


We may then proceed with ¢ just as has been done with ¢ by 
separating the square of x) _,, etc. 

After the separation of » squares from the original function @, 
we notice that the determinant of the resulting function in n — w 
variables is the same as the determinant of the function which 
results from the original function ¢ when we cause the yu last 
variables in it to vanish. If this determinant is denoted by 4,, 
we have the following expression for ¢: 


p(x, Co siemoks Ln) 
ve! A A 
phe oe een) ee ee 
TE BOD Ete ’ 


62. If now ¢ is to be invariably positive and equal to zero only 
when all the variables vanish, the coefficients on the right-hand 
side of the above expression must all be greater than zero. We 
therefore have the theorem 


In order that the quadratic form 
P(X; Ha, 22%, & = DArat pers Ay, =A, 


be a definite form and remain invariably positive, rt 18 necessary 
and sufficient that the quantities A,, A4g,-++,A,-1, Which are 
defined through the equation A) + Aj;A9***An—p,n—po 06 all 
positive and different from zero. If, on the other hand, the quad- 
ratic form is to remain invariably negative, then of the quantities 
An-p An—99** 1» Ay A, the first must be negative, and the following 
must be alternately positive and negative (see Stolz, Wiener Bericht, 
Vol. LVIII (1868), p. 1069). 
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Ul APPLICATION OF THE THEORY OF QUADRATIC 
FORMS TO THE PROBLEM OF MAXIMA AND MINIMA 
STATED IN §§ 47-51 


63. By establishing the criterion of the previous section the 
original investigation regarding the maxima and minima of the 
function / (<1, %, +++, Z,) is finished. The result established in § 57 
may in accordance with the definitions given in § 52 be expressed as 
follows: In order that an extreme of the function f(Z,, Hy, +++, Xp) 
may im reality enter on the position (dy, 4y,+++, Gn) which is deter- 
mined through the equations [6], it ws sufficient, if the second 
derivatives of the function do not all vanish at this position, that 
the aggregate of the terms of the second degree of the equation [4] 
be a definite* quadratic form ; if, however, the form vanishes for 
other values of the variables without changing sign (that is, is semi- 
definite), then a determination as to whether an extreme in reality 
exists is not effected in the manner indicated and requires further 
investigation, as is seen below. 

In virtue of the theorem stated in § 53, an extreme will enter 
for a system of real values of the equation [6] if the homogeneous 
function of the second degree 


x (OF (a, Ly, +*, 2) 
>{( 0x, 0%, Mah S 


Ay Mw Gy, Bg, >**5 
that is, if Dla w (a, Gy, +++, An) Ayh,}, 
Ay me 


is a definite quadratic form ; in other words (§ 62), there will be a 
minimum on the position (a1, dg,+++, A) of the quotients 
4 
RF, 
where P= D tS itoor'+*Sa-njn—p are all positive, a maxi- 
mum if they are all negative. In both cases the quotients must be 
different from zero. 


ieee Théorie des Fonctions, pp. 283, 286; see also Cauchy, Calc. différ., 
p- A 
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This last condition is only another form of what was said 
above, viz., that > Sri, toust not be a semi-definite form. 
For if, say, hs 


Fy=>, thirSoe us *Srnn= 0, 
then the summation > Jrulyh, being denoted by $(hy, hg, +++, hy); 
Ay B 
this equation would directly imply the existence of a relation of 


the form 
Db, les has +++) Fin) = 0, 


where the &, are constants which do not all simultaneously vanish. 
If, therefore, #,,, say, is different from zero, we may write 


and with the help of this relation we have from the equation 


A=n 


A=n-1 
P(hrys hhgs +5 In) =D) Pa (hay begs ++ 25 hn) rx= Pulm t+ DY) daha 
A=1 A=1 


the following relation 


Lice 


A=n-1 k 
Phy, ho, eee) hy) = = Pr (hy, ho, Se hy) (14 ite} 
Now in this expression the arbitrary quantities h may be so 


chosen that 


k 
zn (A=1, 2,+--, n), 


n 


and consequently the function (h,, h,-+-+,%,) would vanish 
without all the h’s becoming simultaneously zero. This case we 
cannot treat in its generality. 

Neglecting this case, it is seen that the problem of this chapter 
is completely treated; however, the conditions that a quadratic 
form shall be a definite one appear in a less symmetric form 
than we wish. It is due to the fact that we have given special 
preponderance to certain variables over the others. 

We shall consequently take up the same subject again in the 
next chapter. 


ee 
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64. The question is often regarding the greatest and the least 
values (the upper and lower limits) which a function may take 
when its variables vary in a given finite or infinite region. If 
this value corresponds to a system of values within the given 
region, then for this system the function will also be a maximum 
or a minimum in the sense derived above. 

For example, let it be required to distribute a positive number 
a into +1 summands, so that the product of the a,th power 
of the first, the ath power of the second, etc., and finally the 
@,,, power of the last summand will be a maximum.* 

The quantities @,, @,-+-,@,4, are to be positive numbers. 
Let 2, Lg +++, Ly, @—%,—%y,—+++—2, be the summands in 
question and write 

OT = 04120972 + + + Ig" (@ — Ly — La— 2 0 « — Ly) 41, 


We must then determine when U or, what is the same thing, 
its natural logarithm, has its greatest value. 

If we put the partial derivatives of log U equal to zero, we will 
have 


ClogU _@ ~ tis a 
0x, @ A—@—++-—a2, ” 
@logU _ a ae 0 
0x, Wg Oh By cen ee ee 
OlogU _ a, _ ae a 
OX, Ly &—Bj— <9 — te 


These equations may be written 


Oy ee Z 
a = ’ 
ay us) ay Bn +4 a+ a+ eS ous Qyet 
the last term being had through addition of the preceding 
proportions. 


If we call 2{%, 2,..., 2 the values of the variables which 
satisfy these equations, we have 


ay 


i * . Xn, 
; Bie si, 
a+ a 1 


210) = a, : 
Bye stat eae 


WO =a 


* Peano, § 137. 


Ne ee Se ESE een 
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The corresponding value of U is 
a Gn +1 


fee 2 41 


UO gtiteten4i s 
(@,+ ese Oe) oe at 1 


To recognize whether U, is the greatest of the values of U, we 
may show that U is in fact a maximum for the system of values 
a, .-++, ¢ and that this position lies on the interior of the realm 
of variability under consideration. For, let #,, 7,+-+-, %, be an- 
other system of positive values of the variables, for which also 
a@—,—+++—2, is positive, and substitute for the variables in 
log U the values 


af + u(a,— 2), .--, 2+ u(a,— 2), where 0<u<1. 


Since the partial derivatives of the first and second order of log UV 
are continuous for all these systems of values, we have through 
the Taylor development, observing that the first derivatives vanish 
on the position 7,..., 2, 


[ae (2 — a)? a (2 — 0)? 
log U = tog, — 3| HE OED. qo ee Z 
2 (Se (ee (2)? 
re aj OO esa J SN a ay 
(a—aP—...— 2p 


where «{?,..-, 2 are values of the variables of the form 
a) + O(a, —a),---, 29 + O(a, — 2), where 0<6<1. 


The expression within the brackets is positive and different 
from zero, since it is assumed that the system of values 2,, 2, 

.-, 2, do not coincide with #10), 20,--+-+, #0. It follows that 
log U < log U, or U < U,, so that U, is, in fact, the greatest 
value which U can assume. 

We note that U takes a smallest value, viz., zero, if one of the 
summands into which a is distributed, vanishes. If we allow the 
summands to take negative values, it no longer follows that U, 
is the greatest of the values UV. 


CHAPTER VI 


THEORY OF MAXIMA AND MINIMA OF FUNCTIONS OF SEVERAL 
VARIABLES THAT ARE SUBJECTED TO SUBSIDIARY CONDITIONS. 
RELATIVE MAXIMA AND MINIMA 


65. In the preceding investigations the variables z,, %,-++, 2, 
were completely independent of one another. 

We now propose the ‘problem: Among all systems of values 
(1, Hy, +++, Ly) find those which cause the function F (x1, Lg, + ++, Xp) 
to have maximum and minimum values and which at the same 
time satisfy the equations of conditions : 


LL] Sa(®y, %qr° +s Mm)=O (A=1, 2,---,m; m<n), 


where fy (Ly, Lq,+++, Ly) and F(x,+++, ,) are functions of the 
same character as f (21, &g,°+*, Lp) in § 47. 

66. The natural way to solve the problem is to express by means 
of equations [1] m of the variables in terms of the remaining 
m —m variables and write their values in F'(#,, £,++-, Z,). This 
function would then depend only upon the n — m variables which 
are independent of one another, and so the present problem 
would be reduced to the one of the preceding chapter. 

In general, this method of procedure cannot be readily per- 
formed, since it is not always possible by means of equations [1] 
to represent in reality m variables as functions of the n —m remain- 
ing variables. A more practicable method must therefore be sought. 

67. If (a1, a,+++, a,) is any system of values of the quantities 


#4, €y,+++, , Which satisfy the equations [1], then of the systems 


of values 
(Q =A, +hy, %=Agthy, +++, L,=A_,+ hy), 


in the neighborhood of (a,,---, a,), only those which satisfy 


the equations [1] may be considered; that is, we must have 


[2] Pr(Qyt hy, Gat hgy s+, Un + hy)=0 (A=1, 2,---, m). 
96 


ee 


Ol 
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Hence by Taylor’s theorem the h’s satisfy the equations 


=n 
[3] De (21, Ag, °++, A) hyy+[hy, hg, +++, hn] 2 =0 
a (V=1, 2,--+, m), 
where [/,, h,, ++ +, h,]? denotes the terms of the second and higher 
dimensions in the respective variables. 
68. It being assumed that at least one of the determinants 
of the mth order which can be produced by neglecting n — m 
columns from the system of m-n quantities 


Tip Sia ray ates 
[4] / Four S02» cate 3 am 


Tins aes eeu Unis 


is different from zero, then (see §§ 135 and 136) m of the quan- 
tities h may be expressed through the remaining n — m quantities 
(which may be denoted by fy, k,,+++, Ky») mm the form of power 
series as follows: 


[5] hy= (Ay, ky, oS Kn —m)+ (Ay, Ky, ea Pea ees ae 
(A= 12,- =, i), 


where the upper indices denote the dimensions of the terms with 
which they are associated. These series converge in the manner 
indicated in § 136; they satisfy identically the equations [2] and 
furnish, if the quantities h,, kj,--+, ky, m are taken sufficiently 
small, all values of the m quantities h which satisfy these 
equations. 

69. The condition that one of the determinants in the preced- 
ing article be different from zero is in general satisfied; there 
are, however, special cases where this is not the case. A geo- 
metrical interpretation will explain these exceptions. 

Let F and an equation of condition #=0 contain only three 
variables x,, %, and 23. 

The equation of condition f(x, %, %3)=0 represents then a 
surface upon which the point (2, 2, #3) is to lie and for which 
F(a, Wp, 5) is to have a maximum or minimum value. 
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The determinants of the first order in the development 
f (GA My, Ag+ hig, Mgt hg)—S (A %, 4s) 


with respect to powers of h,, ha, and h, cannot all be equal to zero; 
that is, all the terms of the first dimension cannot vanish, the 
single terms being these determinants; and this means that the 
surface f = 0 cannot have a singularity at the point in question. 

Take next two equations of condition 7, = 0 and /,=0 between 
three variables ,, z,, and 2. Considered together they represent 
a curve, and the condition that the corresponding determinants of 
the second order cannot all be zero means here that the curve at 
the point in question cannot have a singularity. 

70. If the values of the m quantities h, are substituted in the 
difference Flag ny, OR ae 
this expression then depends only upon the »—~m variables 
ky, kg, +++, ky—m, that are independent of one another and may 
consequently for sufficiently small values of these variables be 
developed in the form 


[6] FP (4, gy +++, Ly) —F (ay, lg, +**, An) 
p=n—m 


=D Oh + SD Coolie b 
p=1 po 


It was seen (§ 51) that, in order to have a maximum or minimum 


on the position (a, @), +--+, @»), it is necessary that the terms of 
the first dimension vanish, and consequently 
[7] C,=0 (p=1, 2,---,n—m). 


71. This condition may be easily expressed in another manner. 
We may obtain the quantities e if, in the development 


F(x, Los***5 Ln) — F(a, Ge; cms An) 
w=n 


1 
= DF ts dD Pub hyt ees, 
w=] A, ¥ 


we substitute in the terms of the first dimension the values of the 
m quantities from [5] and arrange the result according to the 


RELATIVE MAXIMA AND MINIMA 99 


quantities 4), k),---,,_,. In other words, the equations [7] ex- 
=n 


press the condition that Pe h, must vanish identically for all 


systems of values of the is ‘that satisfy the m equations [3] after 
they have been reduced to their linear terms. These are the 


m equations 
B=N 


[8] Pa 2A UN ee oy 


Now multiplying these m equations* respectively by m arbitrary 
quantities ¢,, é,---, ¢,, and adding the results to the equation 


Dhl =); 


we have the following equation : 
B= 
[9] DG Fin + CaS ou t+ ++ + mS mu) yt = 0. 
hw=1 


But the e’s may be so determined that those terms in this sum- 
mation drop out which contain the m quantities h, which are ex- 
pressed in [5] through the »—m other h’s; by causing these 
terms to vanish, a system of m linear equations is obtained, whose 
- determinant by hypothesis is different from zero. 

Since the terms which remain of equation [9] are multiplied 
by the completely arbitrary quantities /,, kj,---+, ky, _m, it is not 
possible for this equation to exist unless each of the single 
coefficients is equal to zero. 

Consequently we have as the first necessary condition for 
the appearance of a maximum or minimum the existence of 
the following system of n equations, 


Fit ey fy + oSoy + t+ + omSmu= 9 (wu =1, 2,+++, 2), 


in the sense that if m of these equations exist independently of 
one another, the remaining »—m of them must be identically 
satisfied through the substitution of the e’s which are derived 


*This method is due to Lagrange, Théorie des Fonctions, p. 268; see also Gauss 
(Theoria Comb. Observ. Supp. § 11). 
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from the first m equation, it being of course presupposed that 
the system of values (a,, @g,++-, 4m) has already been so chosen 
that the equations [1] are satisfied. 

Taking everything into consideration we may say: Jn order 


that the function F (x,, %,+++, Zp) have a maximum or mint- 
mum on any position (dy, %g,+++, 4p), it is necessary that the 
N+m equations 
oF 6 6 7 
4 ey aie rie 44, eS 0 
Ox, 0x, 0x, 0x, 
[10] (i= 1, 2,*+-, 2), 
CANCE Xp,°° 5 Ln) =) As, 2, da m) 


be satisfied by a system of real values of the n+m quantities 
Ly, Xq,° ++, Lny Cy, Co, 2%, Ome 

72. These deductions were made under the one assumption 
that at least one of the determinants of the mth order which 
can be formed out of the m - quantities [4] through the omission 
of n —m columns does not vanish. This condition was necessary 
both for the determination of the quantities h, which satisfy the 
equations [2], and also for the determination of the m factors ¢,, 
Core ty Om 

It may happen* that a maximum or minimum of the function 
F enters on the position (a1, a,-++,@,) even when the above 
condition is not satisfied. For if it is possible in any way to 
determine all systems of values of the h’s not exceeding certain 
limits that satisfy the equations [2], the equations [7] together 
with the equations [1] are sufficient in number to determine the 
M quantities a, M,,+++, Ay. 

When the above assumption is not satisfied, the equations [8] 
exist identically, and consequently the equations [3], which serve 
to determine the h’s, begin with terms of the second dimension. 
We may often in this case proceed advantageously by introducing 
in the place of the original variables a system of n — m new vari- 
ables so chosen that when they are substituted in the given 
equations of condition they identically satisfy them. 


* See Stolz, p. 257. 
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73. To make clear what has been said, the following example 
will be of service ; its general solution is given in the sequel (§ 91). 
Find the shortest line which can be drawn from a given point to 
a gwen surface. Upon the surface there are certain points of 
such a nature that the lines joining these points with the given 
point have the desired property and, besides, stand normal to the 
surface at these points. 

If by chance it happens that one of these points is a double . 
point (node) of the surface, so that at it we have /,=0, /,=0, 
J;= 0, then in reality for this point the terms of the first dimen- 
sion in the equations [2] drop out and we have the case just 
mentioned. 

If the surface is the right cone 

I(@, y, 2)=9 = 2+ P— 2, 
we may write 


C= LW, 
[11] y =v — v4, 
2= ur+ vr, 


The equation of the surface is identically satisfied, and it is easily 
seen that we may express the quantities h,, h., hg through two 
quantities 4, and &, independent of each other even in the case 
where the required point of the surface is the vertex of the cone, 
that is, the point z= 0=y =z, or w= 0 =7; and in fact in such 
a way that not only indefinitely small values of h,, hg, hg corre- 
spond to indefinitely small values of %,, %, but also that all 
systems of values hy, h,, hg are had which satisfy the equation 


S(ethy y¥t+he, 2+hg) = 0. 


The variables, however, must be given at one time real, at another 
time purely imaginary, values if the equations [11] are to repre- 
sent the entire surface of the cone; but in this manner the 
unavoidable trouble has taken such a direction that the proposed — 
problem falls into two similar parts, which may be treated in full 
after the methods of Chapter V. In other cases we may proceed 
in a like manner. The special problem will each time of itself 
offer the most propitious method of procedure. 
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74. We must now establish the criteria from which one can 
determine whether a maximum or minimum of F(x, %,---, 2,) 
really enters or not on a definite position (4, a,---, a»), Which 
has been determined in § 71 above. 

One might consider this superfluous, since in virtue of the eri- 
teria given in the previous chapter a maximum or minimum will 
certainly enter if the aggregate of terms of the second dimension 
in [6] is a definite quadratic form of the nature indicated. 

It is, however, desirable to determine the existence of a maxi- 
mum or minimum without having previously made the develop- 
ment of the function in the form [6]; for in order to obtain the 
coefficients C,, we must pay attention not only to the terms of 
the first dimension but also to the terms of the second dimension, 
when the values of [5] are substituted in the development of 


FF(21, La, ° ++, Lp) —F' (ay, Gig, ++ *, Gn) 


p=n 


= > Fal fe Yiahhet 
B= * bs 


75. The above difficulty may be avoided if we multiply by 
the quantities ¢,(«—=1, 2,---m) respectively each of the expres- 
sions [2] which vanish identically, add them thus multiplied to 
the above difference, and then develop the whole expression with 
respect to the powers of h. 

Owing to equation [9] terms of the first dimension can no 
longer appear in this development, and we have, if we write 


h=m 


[12] F+> f= @, 
a= 
[13] F(x, %,° te Xn) — F(a, Gy, eM) An) =G (xy, Vo; ra +, Xn) 
ae | 
iy G (a, Gare, On) = 9 2 Gadtdot o-. 


We have, accordingly, the homogeneous function of the seis 
degree 2, Crottoe of the formula [6] if we substitute in 2 Gurl 


the values [5] and consider only the terms of the first idoension 


uv, he, 
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in the process. If then the criteria of the preceding chapter are 
applied we can determine whether the function F possesses or 
not a maximum or minimum on the position (a,, d,---, a,). 

76. The definite conditions that have been thus derived are 
unsymmetric for a twofold reason: on the one hand because in 
the determination of the quantities h some of them have been 
given preference over the others, and on the other hand because 
those expressions by means of which it is to be decided whether 
the function of the second degree is continuously positive or con- 
tinuously negative have been formed in an unsymmetric manner 
from the coefficients of the function. 

It is therefore interesting to derive a criterion which is free 
from these faults and which also indicates in many cases how 
the results will turn out. With this in view let us return to the 
problem already treated in the preceding chapter and propose 
the following more general theorem in quadratic forms. 


I. THEORY OF HOMOGENEOUS QUADRATIC FORMS 


77. THEOREM. We have given a homogeneous function of the 


second degree 
[14] D(a, Aa0 + Mn)= DP Ante (Anu = Ayr) 


Aye 
in n variables, which are subjected to the linear homogeneous equa- 


tions of condition 
=n 

[15] A=>/4r,2,.= 0 (A =1, 2,---,m; m<n); 
w=1 


we are required to find the conditions under which > is invariably 
positive or invariably negative for all those systems of values of the 
variables which satisfy equations [15]. 


It is in every respect sufficient to solve this theorem with 
the limitation that the quantities 2 are subjected to the further 
condition 

[16] aetag+..-+af=1; 
for if of +af+.-- +22 =p? then (2) +2) aay +(@) ae 
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x. x, 
Furthermore, if z,,---, 2, satisfy [15], then, also, Ps = 


: ve ed 
satisfy these equations, while, since 4(3 pees, == 500% ++ +52n), 


the signs of the two quadratic forms are the same. 

It is, therefore, in every respect admissible to add the equation 
[16]. We have, however, thereby gained an essential advantage : 
for owing to the condition [16] none of the variables can lie 
without the interval —1...-+1; furthermore, since the function 
varies in a continuous manner, it must necessarily have an upper 
and a lower limit for these values of the variables 2, %,--+, Zp; 
that is, among all systems of values which satisfy the equations 
[15] and [16] there must necessarily be one* which gives an 
upper limit and one which gives a lower limit of ¢ (see § 8). 

We limit ourselves to the determination of the latter. By trial 
we can easily determine whether ¢ reaches its lower limit on the 
boundaries, that is, when one of the z’s = +1, while the others are 
all zero. If this lower limit is not reached on the boundaries, then 
¢ has a minimum value within the boundaries (cf. § 64). 

78. Through the addition of equation [16] the theorem of the 
preceding article is reduced to a problem in the theory of maxima 
and minima; for if the minimum value of $(a, %,+++, 2) is 
positive, @ is certainly a definite positive form. 

Consequently, if we write 


[17] G=$—e(Sw2-1) 1)+25 46 Oy 
A=1 


then, in order to find the position at which there is a minimum 
value of the function, we have to form the system of equations 


0G 
=—=9 = a 
on, (Vad oe » 2). 
aes, ad p=m ag 
This gives etek 2 ex, + ze rpm 0 =1,2,.+.,n), 
or, 
h=n p=m 
[18] SY Aye w— y+ De a,=0 (X=1,2,--+,n). 
w= p=1 


*Crelle’s Journal, Vol. LXXTI, p. 141; see also Serret, Calc. dif’. et int., pp. 17 et seq. 


a a 
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From the n +m +1 equations 


hen p=™m 


DY Artur + D 6A = ON A= 15 2,5%-, 2), 
w=l p=1 
=n 


[19] => Ont, = 0 (p == ik, 2, UH Se m), 


the n+m-+1 quantities x,, 2,---+, Zp, C1, €y° ++, Sm, € Tay be 
determined. Since we know a priori that a minimum value of 
the function ¢ in reality exists on one position, we are certain 
that this system of equations must determine at least one real 
system of values. 

Consequently the first n+ m linear homogeneous equations of 
[19] are consistent with one another and may be solved with 
respect to the unknown quantities 7,, %,+++, ny Cy) Cg***s lm} 
their determinant must therefore vanish, and we must have 


Ay, — @, Ajg, TAL) Ayyy***y Um 
Ag; Agg— @,+++, Agy igs 2 > Og 


[20] Ae= Anp Anges ee Ann — €, Bn? **s Inn | = 0. 
A> 449) se, 1 Upc oral!) 
*») Onn 0, a ee. 0 


The equation Ae=0 is clearly of the ~—mth degree in e. 
The minimum value of ¢ is necessarily contained among the 
roots of this equation; for if we multiply the equations [18] 
respectively by 21, %,---,, and add the results, we have 


[21] p(x, Xo **y Ln) = é, 


‘ it being presupposed that the system of values (21, %,+--+, Xp), 
together with the quantities ¢,, ¢,+++, m, satisfies the system of 
equations [19], which is only possible if ¢ is a root of the equa- 
tion Ae=0. Furthermore, among the systems of values « which 
satisfy the system of equations [19] that system is also to be 


106 THEORY OF MAXIMA AND MINIMA 


found which calls for the minimum, and since the value of the 
function which belongs to such a system of values is always a 
root of equation [20], it follows also that the required minimal 
value of @ must be contained among the roots of this equation. 

As already remarked, this minimal value must be positive if 
is to be continuously positive for the systems of values of the 
xs under consideration, and from this it follows that Ae must 
have only positive roots. For if one root of this equation was 
negative, then for this root we could determine a system of 
values 1, X9, +++, Lp, €4, €g)***, €m for which, as seen from [21], 
¢@ is likewise negative. 


Hence, in order that be continuously positive for all systems 
of values of the «’s which satisfy the equations [15], it is neces- 
sary and sufficient that the equation Ae = 0 have only positive roots.* 


The question next arises, When does the equation Ae=0 
have only positive roots? It may be answered in a completely 
rigorous manner by means of Sturm’s theorem;t but the inves- 
tigation is somewhat difficult; and the symmetry, which we 
especially wish to preserve, would be lost when we applied 
Sturm’s theorem. 

For develop the determinant according to powers of e as 
follows: 


[22] erm __ Bie et Bee ae (— ie Bee ee 0; 


then if all the roots of this equation are real and positive, the 
coefficients B must be all positive, and, reciprocally, if the roots 
of this equation are real and the B’s are all greater than 0, the 
roots of the equation Ae=0 are all positive. The form is then 
a definite quadratic form. The necessary and sufficient condition 
that the form be not a definite one is that e=0 be the smallest 
root of the equation above. 


* See Zajaczkowski, Annals of the Scientific Society of Cracow, Vol. XII (1867) : 
also Richelot, Astronom. Nachr., Vol. XLVI, p. 273. ; 4 ae 

+ Burnside and Panton, Theory of Equations, chap. ix; Hermite, Crelle, Vol. LII, 
p. 43; Serret, Algébre Sup., Vol. I (1866), p. 581; Kronecker, Berlin. Monatsbericht 
February, 1873. 
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79. We shall first show that all the roots of the equation 
Ae=0 are real for the case where no equations of conditions 
are present. (See J. Petzval, Haidinger’s Naturw. Abh. II (1848), 
p- 115.) 

Equation [20] reduces then to the form 


Ay,— ¢, Ajg, sti, Ain 


Ap, Ag,— @, «++, Aon 


[23] = 0, where 4), = Aya, 


WAGs A, ey oe, Nee e 


an equation which is called the equation of secular variations 
and plays an important role in many analytical investigations; 
for example, in the determination of the secular variations of 
the orbits of the planets, as well as in the determination of the 
principal axes of lines and surfaces of the second degree.* 

80. Weierstrass’s prooft, which is very simple, that all the roots 
of this equation are real, depends only upon the theorem that if 
the determinant of a system of 7 homogeneous equations vanishes, 
it is always possible to satisfy the equations through values of 
the unknown quantities that are not all equal to zero. 

Instead of the equation [16] we subject the variables to the 
somewhat more general equation 


V(X, Ly,%**) In) = i 


where y denotes a homogeneous function of the second degree, 
which is always positivef and is only equal to 0 when the 
variables themselves vanish. 


* In this connection the reader is referred to Laplace, Mém. de Paris, Vol. Il (1772), 
pp. 293-363; Euler, Mém. de Berlin (1749-1750) ; Theoria motus corp. sol., chap. v 
(1765); Lagrange, Mém. de Berlin (1778), p. 108; Poison et Hachette, Journ. de 
UVEcole Polytechn., Cah. XI (1802), p. 170; Kummer, Crelle, Vol. XXVI, p. 268; 
Jacobi, Crelle, Vol. XXX, p. 46; Christoffel, Credle, Vol. LXIII, p. 257; Bauer, Crelle, 
Vol. LXXI, p. 40; Borchardt, Liowv. Journ., Vol. XII, p. 30; Sylvester, Phil. Mag., 
Vol. II (1852), p. 138; Salmon, Modern Higher Algebra, Lesson VI; and see in par- 
ticular Edward Smith, Solution of the Equation of Secular Variation by a Method due 
to Hermite. (Dissertation, University of Virginia. 1917.) Numerous other references 
are given in the paper last mentioned. 

+ Weierstrass, Berlin. Monatsbericht, May 18, 1868. Cf. also Kronecker, Berlin. 
Monatsbericht (1874), p. 1. 

t Note the lemma of §§ 83, 84, and 85. 
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81. If we form the system of equations (see [12] of preceding 
chapter) 

[24] d,—ew,=0 (A=1, 2,--+, 0), 
then these equations may always be solved if their determinant 


vanishes. 
This determinant is exactly the same as that in [23] if we write 


V= a ag. 
A=1 
We assume that e = k + li, where i =v — 1, and that we have found 
2 = & Fmt (A=1, 25>», 2) 
as a system of values that satisfy the equations [24]. 
We must consequently have 
py (E+ nyt; E+ Not sor? + Nn?) 
an (k <5 la) (Ey + nyt, E+ No, ag car UEXD) = 0 
(A=1, 2,<5s, 7). 
Since the real and the imaginary parts of these equations must of 


themselves be zero, it follows, when we observe that ¢, and y, 
are linear functions of the variables, that 


$y (&, 2e eats é,)— key, (E,, ae 5 phone: | E,)+ IN, (1, Nas***> Nn) = 0, 
$y (ny, Tax 2 2) %s 5) kr, (4 Nort * > Mn) — INn, (E,, oe pa AS Ej= 0. 


82. Next multiply these equations respectively by », and &,, 
take the summation over them from 1 to n, and subtracting one 
of the resulting equations from the other, then, since (see [17] of 
the preceding chapter) 


DndalEr E, Bees En) = DE (my Na>°* +2 Nn)s 
Daa (Ev E., a} En). = DEM (m, Ag y'*) 5 Tels 


we have 


Saban UD) tn) + DE (Er Eas wieses £,) b= 0, 


or, 


[25] Link (, 1] 9 s\n Mn) + WE, Eo, sie E,)} = 0. 
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If it is possible to find systems of values of the quantities «,, 
%,+++, ©, Which satisfy the equation [24] under the assumption 
that e=k+1i, then these values must satisfy at the same time 
[25]; but since after our hypothesis the quantity within the 
brackets cannot vanish, it follows that / must be equal to zero; 
that is, every value of e for which the determinant vanishes, 
is real. 

Hence we have the theorem: 


In order that a quadratic form (a, %g,+++, pm) be invariably 
positive, it is necessary and sufficient that the development of the 
determinant [23] which admits of only real roots, when expanded 
im powers of e, viz. 


[26] e" — Bie 14 Boet-2— ... +(—1)"B, = 0, 


consist of n+1 terms and that these terms be alternately positive 
and negative. 

If the function is to be invariably negative, then the equation 
[26] must be complete and have continuation of sign. 


Thus for the case where the variables are subjected to no 
conditions we have derived the criteria as to whether or not a 
homogeneous quadratic form is a definite one directly from the 
coefficients of the function and in a form that is perfectly 
symmetric. : b 

83. Lemma. If a homogeneous function of the second degree 
apr (a1, Hy) **+, Zp) can become zero for any system of real values 
of the variables which are not all zero, then yw may be both 
positive and negative, it being presupposed that the determinant 
of yw is different from zero. 

Let the function y vanish for the system of values (&,, &,---, &,) 
and instead of, 2, %,--+, % write in w the arguments &,+ ¢,h, 
E,+ cok,--+, En+¢,k, where the c’s are indeterminate constants. 

Developing with respect to powers of & we have 


WE t+ ck, E+ Cok, Pa Cac Ck) 
Bang) b> catbe (E,, Es, = +) En) + Rb (ce, Cg, ++ +s Cn) (1) 
oil 
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By hypothesis the £’s are not all zero, and the determinant of + 
being different from zero, it follows that y.(#=1, 2,---, ”) can- 
not all be zero. 

Since, furthermore, ¢,(a#=1, 2,---,) are arbitrary constants, we 


a=N 


may so choose them that Scere (E,, &, +++, E,) is not equal to zero. 


Now by taking & siihciontie small we may cause the sign of 
the expression (i) to depend only upon the first term on the 
right-hand side of that expression. 

Hence, if we choose & positive or negative, we have systems 
of values (2, 2,+++, ,) which make y positive or negative. 

84. The determinant of the system of equations [24] is formed 
from the partial derivatives of 


(x, Ho, **5 Ln) — ew (£4, Has? * *, Ln)> 


that is, from 2(X1, a, +++, B_)— CWa(%, La, +> +, Lp) = 0 (3) 
(@=1,2,.< = =, /m), 
1é 1é : ; 
where ¢, and wW, denote i and pas respectively. If this 
determinant is equal to zero for a value of e, it follows that we 
can give to the variables z,, %,---, Z, values that are not all 


zero and in such a way that the » equations (i) exist. Let this 

value of e be e=k+M; then if /=0, it may be shown that 

the function ~ can have both positive and negative values. 
Denote the system of values (a, #,--+-, %,) which satisfy the 


equation (7) by te=Egting (@=1, 2,---,n); 


then, as in § 82, it may be proved that 
LiW(é,, fo, SN) E,)- ve (n,, Dowie ks Mn) |= 0. (vit) 


Since by hypothesis / is not zero, the equation (¢z) can only 
exist either when y(&,, &,---, &,) and W(n, No» * -+,; %) have 
opposite values (and then it is proved, what we wish to show, 
that wy can have both positive and negative values), or when 
the two values of the function are both zero (and then from 


what was seen in the preceding section can take both positive 
and negative values). 


» > 
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85. In this connection it is interesting to prove the following 
theorem: If the determinant formed from the partial derivatives 
of the homogeneous quadratic form (a4, Ly +++, Ly) is different 
Srom zero, and if among the infinite number of quadratic Sorms 

Ad (Hy) Xap ++ +5 Gp) MAP (dy, Loy +++; Ly) 
there 1s one definite quadratic form, the determinant formed from 
the partial derivatives of 

P (Hy, Ly, +++) Uy) — CW (Hy, Ly, +++, Lp) 


vanishes for only real values of e. 

The theorem will also be true if the determinant of ¢ (and 
not as assumed of w) is different from zero. 

Let 4,6 + 4, be a definite quadratic form, and write 


MP + MY = v (Hp Wy, +++, Ln). 


We shall further choose two constants \, and my in such a way 
that when we put 


oh + Low = b (24, Ho, ee Ln)s 


¢ is different from zero. 
We know from the previous article that the determinant 


formed from the equations 
bi tig=0 (@=1,2,..-, 2) 
can only vanish for real values of /.. The equations 
ba—kbe=0 (a@=1, 2,---, 2) (iv) 
may be written in the form 
(Xo — Aq) bat (Mo— Koy) Wa= 90 (=I, 2,--+, 0), 


kp 
ie == ll, 2,22 35 1). Vv 
or bem Eye (0 ) () 


If we eliminate 2, %,-++, %», from these equations, we must 
have the same determinant for their solution as from the equa- 


tions (iv). 
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Hencé every & which causes this last determinant to vanish 
must also cause the first determinant to vanish. But the k’s are 
all real. It follows that if we form from them the m expressions 


_ key Bo 
ah 


these quantities must also be real. 


Hence the determinant of the n equations 


ga— eha= 0 (a =1, 2,-++, 2) 
has always n real roots e. 


We may therefore say: Jf among all the quadratic forms 
which are contained in the form ° 


AG (ly) Lay ++ +5 Tn) + MAY (Ly, Lg, ++ +, Ly), 
there 1s one which can have only positive or only negative values, 
then the determinant of ¢ — ew will have only real roots, it being 
assumed that the determinant of } or of is not zero. 

The theorem in § 80 is accordingly proved in its greatest 
generality. 

86. The case where equations of condition are present may 
be easily reduced to the case already considered. The determi- 
nant [20] was the result of eliminating the quantities z,, ,--+, 2p, 
€1, €,***, €m from the n +m equations 

p=m 


[18] SA ert & G,=0 (A=1, 2,=.-,), 


BREN 


[15] 6,= = Sane, = 0 (o=1, 2,+5%, %). 


Since the result of the elimination is independent of the way 
in which it has been effected, we may first consider m of the 
quantities 2, say: 24, %,+++%,, expressed by means of the equa- 
tions [15] in terms of the remaining n—m of the 2's, which 
may be denoted by &, &,---, &,_ We thus have 


v=nr—-™Mm 


[27] %= Sin £, (w= 1, 2,-2+, am). 
v=1 
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Through the substitution of these values, let h(%, Ba,++ +, zn) 
be transformed into $(€,; &,-- -, €,_m) and the equation 


A=n 


Die! into W(&, &,---, E,_m)= 


The function y is invariably positive and is only equal to zero 
when the variables themselves all vanish. 
The equations [18] may be written in the form: 


10 p=™m 
Te i= Qr==1) 25.) 7)! 
Multiplying these equations respectively by 20 A=, 2,+- >, 2); 
and adding the results, then, since , 
Sees Ox, yi 00, 
Ka OX 0€, 0€, 
=z OL, _ op 
10x, 0&,  0€, 
Oe we Le en 
i x2, 
NES! coe: 2 0€, 2 


A=n p=™m 
Lee aL, é ee 0, _ 4 (v=1, 20-5, 
The last term of this equation drops out if we substitute in it the 
expressions [27], since’ the 8, expressed in the &s vanish identi- 
cally, and we have the equations 


[28] LD ene (v=1, 2,--+,m—). 


Now give v all values from 1 to » —_m, and we have a system of 
n—m linear homogeneous equations, from which we may eliminate 
the yet remaining &,, &,---, &,m- The result of this elimination 
is an equation in e and must give the same roots in eas [20]. The 
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equations [28] are, however, created in exactly the same manner 
as the equations [24]. If, then, Ae is the determinant of these 
equations, it follows that the roots of the equation Ae=0 are 
all real. 

87. As the solution of the theorem proposed in § 77 the final 
result is: 


In order that the homogeneous function of the second degree 
P(X; Ha," *5 Ln) =D) Ary trys 
As 
A), = Any 


be invariably positive for all systems of values of the quantities 

Ly, Lay +++, Ln, Which satisfy the m linear homogeneous equations 

of condition ened 

9= Zonta = G~ {p= 12 -= my: 
i 


ut is necessary and sufficient that the form of the equation [20], 
developed with respect to powers of e and which has only real 
roots, consist of n—m-+1 terms and that the signs associated 
with these terms be alternately positive and negative. There must, 
however, be only a continuation of sign if > is to be invariably 
negative. 


The above method was first discovered by Lagrange, who did 
not, however, sufficiently emphasize the reality of the roots of 
equation [20]. 


4 
Il. APPLICATION OF THE CRITERIA JUST FOUND TO THE 
PROBLEM OF THIS CHAPTER 


88. We have determined the exact conditions necessary for a 
homogeneous quadratic form to be definite for the case where the 
variables are to satisfy equations of condition and in a manner 
entirely symmetric in the coefficients of the given function 
together with those of the given equations of condition. 

At the same time with the solution of this problem, the 
problem of maxima and minima which we have ‘proposed in tl 
chapter is solved. 
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mg regard to the remarks made in $71 and § 74 we 
sang as a final result of our investigations the following theorem: 


THEOREM. If those positions are to be found on which a given 
regular function F(a, Ly,+++,%,) has a maximum or minimum 
value under the condition that the n variables Ly, By, + ++, Ly satisfy 
the m equations 

[@] IX, Mos - 39 ty) — 0 (K= 1, 2,- + 5); 
where f, are likewise regular functions, we write 


=m 
[d] PS ish Giga es 3), 
and seek the system of real values 
Kepeidas es ly C1, Con oy One 
which satisfy the n +m equations 
(26 
[e] 4 02 
A= 0 (M25 070) 
Tf (a1, Go,+ ++, Gn) ts such a system of values of 1, Xg,+++, Ln, 
then we develop the difference 
G (A+ hy, byt Ng, > + +, Gy+ Mn) — G (Oy, Gq, ++ +) Gy) 


with respect to powers of h, and have (since no terms of the first 
dimension can appear, owing to equations [c]) the following 


development : 


[a] Gay t+ hy, Got Ng, +++, Gy +My) — G (Ay, Uy, ++ +) Mn) 
1 
=5>) Guy (By, Gay * +) Mn) hh 
ByY 
We must next see whether the function 
[e] h(hy, hg ++; i= >, Guy hyly 
My Vv 


is invariably positive or invariably negative for all systems of 
values of the l’s which satisfy the m equations 


— (0 (A=1, 2,--+, 2), 


K=Nn 


al ae, PAGE Mg, *° *5 ay) h,= 0 (p =i, 2, aL) mM). 
p= 1 
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To do this we form the determinant 

Gy4— 4 Gio, i ae Gin Jip Tov? * ink 

Goi; Gog — 6, °° *s Gon Six a fb ae ez 

[9] Gav Gre» es; Can bia Tw rai SAS pe ’ 
Jie Jim oh ae, Oe OSS one 2. 0 


Inv Fete) ay ay 0, 0, --+,0 


and this determinant put equal to 0 is an equation of the m—n 
degree in e, which has only real roots. Developing the determinant 
with respect to powers of ¢, we have to see whether the develop- 
ment consists of »n—m+1 terms with alternately positive and 
negative sign or with only continuation of sign. 

If the first is the case, the function > is invariably positive, 
and the function F has on the position (a4, 4g,+++, &_) & minimum 
value; if, on the contrary, the latter is true, then is invariably 
negative, and F has on the position (a4, 4),+++,; Gp) &@ maximum 
value. 


This criterion fails, however, when ¢ vanishes identically, 
because the quantities G,, vanish for the position (a), dg, +--+, @,); 
and it also fails when the smallest or greatest root of Ae=0 is 
zero, since in this case we may always so choose the h’s that | 
¢ vanishes without the ’s being all identically zero (see § 83). : 
In the latter case the function (x) is an indefinite or a semi- 
definite form (§ 78). 

In both of these cases the development [d@] begins with terms 
of the third or higher dimensions, and for the same reason 
as that stated at the end of § 63 we cannot assert that in 
general a maximum or minimum will enter on the position 
(41, Gq, +++, Gp). 

90. We give next two geometrical examples illustrating the 
above principles. 


wo 
a a eee eee 


ProBLeM I. Determine the greatest and the smallest curvature 
at a regular point of a surface F(a, y, 2)= 0. 
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If at a regular point P of a plane curve we draw a tangent 
and from a neighboring point P’ on the curve we drop a perpen- 


dicular P’Q upon this tangent, then the value that 2 — ne saab 

Pp” As 
approaches, if we let P’ come indefinitely near P, is called the cwr- 
vature of the curve at the point P. If the curve is a circle with 


. ; 1 allot 
radius 7, the above ratio approaches — as a limiting value and is, 
T 


therefore, the same for all points of the circle. Now construct 
the osculating circle which passes 
through the two neighboring points P 
and P’ of the given curve. The arc of 
the circle PP’ may be put equal to the 
arc PP! of the curve, when P and P’ 
are taken very near each other, and 
consequently, if 7 is the radius of this 
circle, the curvature of the curve is 
determined through the formula 


2p Tel 


Et PP” 7 Fie. 11 

The quantity 7 is called the radius of curvature, and the cen- 
ter M of the circle which lies on the normal drawn to the curve 
at the point P is known as the center of curvature at the point P. 
The curvature is counted positive or negative according as the 
line P’Q, or, what amounts to the same thing, J/P has the same 
or opposite direction as that direction of the normal which has 
been chosen positive. 

If we have a given surface and if the normal at any regular 
point of this surface is drawn, then every plane drawn through 
this normal will cut the surface in a curve which has at the 
point P a definite tangent and a definite curvature in the sense 
given above. 

The curvature of this curve at the point P is called the curva- 
ture of the surface at the point P=(z, y, 2) in the direction of the 
tangent which is determined through the normal section in question. 
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Following the definitions given above it is easy to fix the 
analytic conception of the curvature of a surface and then to 
formulate the problem in an analytic manner. 

If P’=(z', y', z) is a neighboring point of P on the surface, 
the equation of the surface may be written in the form: 


2] 0=F(e'—2)+ Fy’ +h—2) 
+ 3 {Fy (2 — a)? + Byo(y! — ¥)? + Pag(z! — 2)? 
+ 2 Fy(a!— 2) (y' — y) + 2 Fag(y' — 9) (2 — 2) 
+ 2 F,,(z’ — 2)(a'— x)}+---, 


OF oF oF 
—= ——» Nii — F. ie Say 
where F, a 2 by ae 
er or er 
= 38 22 Oy? 35 a2 
OF or Or 


The. equation of the tangential plane at the point P is 
[3] FE —2)+ FL —y)+ #3 ($ —2)= 0, 


where &, 7, are the running codrdinates. 
Therefore, if we write for brevity 


[4] VF2+ F2+ F2=H, 


and take as the positive direction of the normal of the surface 
at the point P that direction for which H is positive, then the 
direction-cosines of this normal are 


Consequently the distance from P’ to the tangential plane is 
F FF, ig 
5 P' = nd iee9 Wil Sify Y 
(5] Q=F @!- a) +By'- n+ 2-2) 


The negative or positive sign is to be given to the expression 
on the right-hand side according as the length P’@ has the same 


ee eS aS ae ee, ee erm, 
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or opposite direction as that direction of the normal which has 
been chosen positive. 

In the first case, paying attention to [2], which has to be 
satisfied, since P’ lies upon the surface, we have 


Dees 
[6] ey 
PP! 
& Fy (2! — 2) + Fooly! —y)? + Foa(e! — 2)? +2 Fyo(x'— 2) (y'—y) + ++: ‘ 
HS? 
where S? = (a! — 2)? +(y! — y)?+ (2 — 2). 


In the case where the direction P’Q is contrary to the positive 
direction of the normal, we must give the negative sign to the 
right-hand side of [6]. 

Now let P’ approach nearer and nearer P; then the quantities 

a—-ax yy 2-2 


’ 


S Saaey 2S 


which represent the direction-cosines of the line PP’, become the 
direction-cosines of the tangent at the point P of the normal sec- 


tion that is determined through P’. Representing these by a, 8, y 
ee) 


and the limiting value of 2 ——s by «, then 
Ie 


1 
(We “= i {P02 + Fy P+ Fogy? + 2 Fy@B + 2 Foghy + 2 F317}, 


where the terms of higher degree in «'— a, etc. are neglected. In 
this formula « represents the curvature of the surface in the direc- 
tion determined by a, 8, y. This is to be taken positive or negative 
according as the direction of the length MP, where M is the center 
of curvature, corresponds to the positive direction or not. 

If the codrdinates of the center of curvature are represented 
by Zp, Yo % and the radius of curvature by p, then 


or, since k=-> 


120 THEORY OF MAXIMA AND MINIMA 


BY ye a ee 

0 Pye? + Fog BP + +++ + 2 yyy 
Fy 

8] 9" aw PR 
%— hy fs 


Fyy@+ Py Bet +++ +2 Faye 


Since H does not appear in these expressions, we see that the 
position of the center of curvature is independent of the choice 
of the direction of the normal. 

Suppose that the normal plane which is determined through 
the direction a, 8, y is turned about the normal until it returns 
to its original position. Then, while a, 8, y vary in a definite 
manner, the function « of a, 8, y assumes different values at 
every instance, and since it is a regular function, it must have 
a maximum value for a definite system of values (a, 8, y) and 
likewise also a minimum value for another definite system of 
values (a, B, y). © 


Pera | 
The quantity ;e has. the same value for all normal sections 


that are laid through the same normal.* We have, therefore, 
to seek the systems of values (a, 8, y) for which the expression 


Fy 0+ Fog B82 + Fosy2+ 2 FyoaB + 2 Fy38y + 2 sya 


assumes its greatest and its smallest value. 
We have also to observe that the variables a, 8, y must satisfy 
the equations of condition 


{ Fia+F,8+ Fay =0, 


[9] 2 2 
OPA nae 1 


the first of which says that the direction which is determined 
through a, 8, y is to lie in the tangential plane of the surface 
at the point P, while the second equation is the well-known 


relation among the direction-cosines of a straight line in space. 


*See Salmon, 4 Treatise on the Analytic Geometry of Three Dimensions (Fourth 
Edition), p. 259. : 


ee ee ——— ee ee — - 


— a 
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Following the methods indicated in § 89, we write 
[10] G=Fya?+ Fy B2+ .-. +2 Fy ya 
—e(a?+ B+ P—1)4+ 2e(Fat+ Fi B+ FY), 
and we then have (§ 89, [c]) to form the equations 


Gs Eye aan ieee 


pee 8 ort ree ot 
Pia+F, 8+ Fey = 0, 


0, 


from which we must eliminate a, 8, y, and e’. 
These equations are 
(Fy — a+ FB + Hisy 2 + ie = 0, 
Pyy% + (Fo.—¢) B+ Fos + Fe’ = 0, 


[11] oe 
Ft +F 8 + (P53 — €)y + Fee’ = 0, 
ia +f,8 + Fey =i 

where FyY=F,, (A, #=1, 2, 3). 
Through elimination we have 
By—% Fy, Fis, fy, 

[12] Fy, Fyg— 6 Fags fy 0" 
Fy, P39, Fyg3—e, Ms 
F;, Fi, F, 0 


This is an equation of the second degree in e, and consequently 
gives us two values ¢, and e,, which are maximum and minimum 
values, since both maximum and minimum values enter, as 
shown above. Multiplying the first three equations [11] by a, £, 
y respectively and adding the results, we have 


[13] Fy? + Fy BP + Fygy?+ 2 Fygth+ 2 FysBy + 2 Fay ya =e. 
Hence, from [7] we have 


1 
[14] ea =. 
per 
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Consequently the two principal curvatures at the point P have 


the values 1 e 
che 
[15] Leu 

. Pe A 


and the coordinates of the corresponding centers of curvature are 
found from the formule 


F, Fy _ 
B= Op ea) eee Ol aaa oe © ee eae ae 
e e ey 
16 
oe Pies RE Ware a ede Ek 
Oy ee 2 net eae 
i) 2 2 


In order to determine e, let us write 
Dy = (Foq— &) (F'33— &) — Fos*s 
Dyy= Fog Fig — Fp (P33— &), ; 
and form from these the corresponding quantities through the 
cyclic interchange of the indices. Equation [12] may be written 
in the form* 
Dy F} + Dyy¥} + Dog F 2 + 2 Dyy FF y+ 2 Dog Py Fg + 2 Dy Fy P= 0. 
Developing this expression with respect to powers of e, we have 
[17] H*?— Le+M= 0, 
where L= H?(Fyy+ Fo + Fys)— (Fy FY + Foo FY + F333) 
+2 FoF Pot 2 Fog Fo l'gt+ 2 FoF, 
and = M= (Fy F'y3— Fog) FP + (Pag Py — Fy?) FP 
+ (Fy Fo — Fy) FF + Fie Fis — Fos Fy) FoF: 
+ (Pos Foy — Lyi Po) Fa Py + (P31 P59 — Fa P35) FP o- 
From [17] we have at once the values of the sum and the 
product of the two principal curvatures, viz. (see equation [15]): 


dj seaene 
~ 8" 
[18] Py P2 
ha, 
Pip, H* 


* See Salmon, loc. cit., p. 257. 


— 
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We have thus expressed the sum of the reciprocal radii of curva- 
ture and also the measure of curvature of the surface at the point P 
directly through the codrdinates of this point. 

Although the formule are somewhat complicated, they are 
used extensively and with great advantage. 

In the case of minimal surfaces,* which are characterized 
through the equation 

Pit Pa= 0, 

we have = 0: 


This is therefore the general differential equation for minimal 
surfaces. 


91. Prope II. From a given point (a,b, c) to a given surface 
F(a, y, 2)=0 draw a straight line whose length 1s a maximum 
or a minimum. 

Write G=(x —a)*+(y—b)?+(2¢—¢)?+2AF (a, y, 2). (2) 
Then the quantities x, y, z, % are to be determined (see § 89, [c]) 
from the following equations : 

e—-a+rAF,=0, 
y—b+AF,= 0, 
z—c+AF,=0, 

(x, y,2)= 0. 


(22) 


It follows, since F,, /,, F, are proportional to the direction- 
cosines of the normal to the surface at the point (a, y, 2), that the 
points determined through these equations are such that lines 
joining them to the point (a, 6, c) stand normal to the surface. 

If P=(z, y, 2) is such a point, then to determine whether for 
this point the quantity 

(w — a+ (y — bP + (2 — 6? 
is in reality a maximum or a minimum, we substitute x+ u, 


y+, 2+w instead of x, y, 2 in the function G. The quantities 
u, v, w are, of course, taken very small. 


*See papers by the author on this subject in the first numbers of the Mathematical 
Review. 
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We must develop the difference 
G(a+u, y+, 2+w)—G(a, y, 2) (212) 


in powers of w, v, and w. 
The terms of the first dimension drop out, and the aggregate 
of the terms of the second dimension is 


p= urt+ + wPt A(Fyyu? + Fogv? + Fu" 
+ 2 F,.uv + 2 Fyvw + 2 Fy,wu). (ww) 


Since the point («+u, y+v, z+w) must also lie upon the surface, 
the quantities w, v, w must satisfy the condition 


Fyu+Fy + Fw = 0, (v) 


where the terms of the higher dimensions are omitted (see [8] of 
the present chapter). as 

If we wish to determine whether the function y is invariably 
positive or invariably negative for all systems of values (wu, v, w) 
which satisfy equation (v), we may seek the minimum or maximum 
of this function yy under the condition that the variables are limited, 
besides the equation (v), to the further restriction (cf. [16] of § 77) 
that 


u2t yt y2—1= 0. (v7) 
For this purpose we form the function 
y — e(ut+ + w—1)4 2¢ (Fut Fv + Fw), (vit) 


and writing its partial derivatives with respect to w, v, and w equal 
to zero, we derive the equations 


é 
(#,:- 


Fyut+ (Fa. : 


at ! 
"y )u + Fyyv + Fw +5 Fy= 0, 


—1 
)e aS (ere ei “= 0, 4 (vitt) 


Fy + Fyo0 + (Fy_— wo F= 9. 


J 


i 
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Spires e! ; me 
Eliminating wu, v, w, ~ from equations (v) and (viti), we have here 


exactly the same system of equations as in [12] of the preceding 
problem, except that here — and e’ stand in the place of e¢ and e’. 


Denote the two roots of the quadratic equation in e, which is 
the result of the above elimination, by e, and e,, and the corre- 
sponding radii of curvature of the normal sections by p, and p,; 


: e—1 : : : 
then, since re has the same meaning as ¢ in the previous problem, 


aL q-1l 1 
Px Xr vou 
ee see 
Pe n 


where the positive direction of the normal to the surface is so 
chosen that H> 0. 

If for the position (z, y, z) a minimum of the distance is to 
enter, then both values of the e must be positive; if a maximum, 
then ¢, and e, must be negative. 

It is easy to give a geometric interpretation of iis result : 
Let PN be the positive direction of the normal and A = (a, 6, ¢). 
Then from (7) it follows that the length from A to P has the 
same or opposite direction as PN, according as » is negative 
or positive. . 


Hence, from (7), 
AP=—)2)H. 


If the centers of curvature corresponding to p, and p, be denoted 
by UM, and &,, then 


P 
‘BE 
fy ee 
MA M,A 
Hence ¢=—— and ¢=—*.. 
1 UP 2 M,P 
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If, then, M@, and M, lie on the same side of P and if 4 lies be- 
tween M, and M,, as in Figs. 12 and 13, then the ¢’s have different 
signs and there is neither a maxi- 
mum nor a minimum. 

If M, and M, lie on the same side 
of P while 4 is without the inter _. PP M A Mm 
val M,--- M,, then a minimum or Fic. 13 
maximum will enter according as 4 
starting from. one of the*cebters Ol =~ = <=, see 
curvature lies upon the same side as 
P or not (see Figs. 14 and 15). A _M MM. P 

If the points M@, and M, lie on Fic. 15 
different sides of P and if A is situ- 
ated within the interval M7, --- I, 
as in Fig. 16, then there is always 
a minimum. If, however, A lies without the interval W, --- M,, 
then there is neither a maximum nor a minimum. 

In whatever manner M, and M, may lie, if A coincides with 
one of these points, then one of the two values of e is equal to 
zero, and the general remark stated at the end of § 89 is applicable. 

The above results are derived in a different manner by Goursat, 
Cours D’ Analyse, Vol. I, p. 118. 

The case may also happen here (see § 72) that in the solution 
of the equations (7) and (#2) a singular point of the surface is 
found at the point P, at which 7, =0=F,=F;. We cannot pro- 
ceed as above, since, there being no definite normal of the surface 
at such a point, the determination whether for this point a maxi- 
mum or minimum really exist cannot be decided in the manner 
we have just given. 

The general remark of § 73 indicates how we are to proceed. 

92. Brand’s problems. The two following problems taken from 
the theory of light were prepared by my colleague, Professor 
Louis Brand. 


ProsLeM I. Reflection at the surface F(x, y, 2)=0. A ray passes 


Srom a point F to a point P on a given surface and is reflected to 
a point . When is BP+PE a minimum ? e3 


M, A M, Pp 
Fie. 12 


MP SAMs 
Fie. 16 
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Write PR=d, and PR=d, so that 


ds=V (a; — a+ (yi — y+ @;— 2)? (@=1, 2). 
We seek to find the condition that makes d,+d, an extreme 
when P is subjected to the condition of lying on the surface 
[1] F(%, y, 2)= 0. P, P, 
Using the Lagrangian method (§ 89) wc - 
must find the extremes of the function 


f(z, y, 2)=d,+ d,+rF (a, y, 2). P 
Writing ¢, f or POL? o etc, the Fie. 17 
necessary conditions re an extreme, viz., $, = ¢, = $, = 0, give 
a aL ve 25 San 
[2] Maes NF, 
= Z fe ae z XP. 


Let the direction-cosines of the lines PA and PF, be l,, m,, n, and 
1,, My, Ny respectively ; and let /, m, n denote the direction-cosines 
of the normal to the surface [1] at the point P. Furthermore, 
since F’,, F,, F, are proportional to /, m, n, write 

AF, =kl, AFy=khm, XF,= kn. 


Equations [2] then become 


on C= hi, 
[3] M+ m,= km, 
pen Ny = kn. 


Designate the angle between PA and PF by (1, 2); between PF 
and the normal by (1, 2); between P& and the normal by (2, 7). 
It is seen then that 

cos (1, 2)=1,l,+ mymg+ 2%, 

cos(1, n)= 1,1 + mym+ mn, 

cos (2, 2) = 1,1 + mgm + Non. 


128 THEORY OF MAXIMA AND MINIMA 


Multiplying equations [3] by J,, m4, m4, respectively, and adding, 
it follows, since 1? + m?+2?= 1, that 


[4] 1+ cos(1, 2)=cos(1, n). 


Similarly, by multiplying equations [3] by /,, mg, Ng, respec- 
tively, and adding, we get 


[5] cos(1, 2) +1=cos(2, n). 
From [4] and [5] we have 

cos(1, n)= cos(2, 7), or 
[6] (1, n)=(2, 2). 


Moreover, upon multiplying equations [3] by /, m, n, respectively, 
and adding, we get 


cos(1, n)+ cos (2, n) = &, or,.from [6], 
p= 2 cos(1, 7). 


Substituting this value of & in [4], we have 


1+ cos(1, 2) = 2 cos?(1, n), 
so that cos(1,'2) = 2 cos#(1, n) —1= cos 2(1, n)= cos 2(2, n). 
It follows that (1, 2)= 244, n)= 2(2;-n), 


and that the lines PA, PA, and the normal must lie in the same 
plane, and it is further seen that the normal bisects the angle 
between PF and P£. 

We have thus arrived at the condition which is an optical law: 
The incident and reflected rays must lie in a normal plane, and 
the angle of incidence must be equal to the angle of reflection. 

The above result is merely a necessary condition for an extreme; 
to find whether an extreme really exists, and if it does, whether 
it is @ maximum or a minimum, let us choose the plane BPE as 
the «y-plane. 

If the curve cut from the surface by the plane EPR has 
the equation 


[7] y=f(*), 


oo 
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the problem now becomes to determine the nature of the point 
P which makes ve Male 0, where 

V@)= At dg=V (a1 — P+ (yy — YP (lg — 2)*+ (Ya— 9), 
the y being replaced by /(z). 


Now 2% @ = +ty—w)y' , @=m)+y—H)y 
ax d, 3 


while the equation of the normal to the curve [7] at P(z, y) is 
(7 — E)+(y— 7) ¥p= 0, 


and the distance of the point (;, y;) from this normal is 


jp, — Co MEY = WYP, 
Vi+ yf? 


Further, take the origin at the point P and the tangent 
to the surface at P lying in the plane AP£# as the z-axis. 


Then gol = 0 shows that 
dz 


and as f, and h, have opposite signs, since A and & lie upon 
opposite sides of aie normal, 


sin (1, n) = sin(2, n), 


or (1, n)=(2, n), 
as stated before in [6]. 
Note that 
yi 
dy[1+ y?@+y—y)y¥"I—- > [@—-%)+¥-")y'P 
yp ey a = 49 | dy [( 1 1 
dat d? 


1 

dg[1+ y+ (y— ya) y"]— 5 [le — %) + ¥—Ha) ¥'P 

Sa aaainie GREASE 
2 
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It follows that for the origin and the direction y'= 0, 


" 
d,(1— ny") 4 d,(1— yay") — z 
dy 2 
———————— + —————_ A « 
dat d? 
Writing @=(1, »)={2) a); 
we note that 41 — 7 — cos @, 2 
1 2 ; 
= a = “2 == sit @, | 
d, dy 
2 
so that a =(7 + a cos?@—2y"cos@. = Fie. 18 


From this it is seen that 


=9 
dx? 2\d, 


Since y'/= 0, we note that y" is the curvature of curve y=/(z) at the 


ss infra 
oo = 0 according as y= le == 7) 0. 
2 


origin, that is, y!’ = ; » where p is the radius of curvature. Hence, 
when 


ay es! 
=<=(—+— 6, 
UP) P al es i) 
dBA ini ; wh 
Tx? >0, and the path is a minimum; when 
12 / oe 
-~>=(—+— 6, 
7 pala ta) 
dp : 
dt <0, and the path is a maximum. 
To interpret this result geometrically it is seen that 
1 gs ea 
=(—+— 0 
laa) 


is the curvature of the ellipse whose foci are at F and B and which 
passes through P (see Pascal, Repertoriwm der Héheren Mathematik, 
Vol. IT, 1, p. 245). The quantities d, and d, are its focal radii 
at P, and @ is the angle between either focal radius and the normal 
to the ellipse at P. Note that this ellipse is tangent to the curve 
[7], since the normal to the curve bisects the angle between the 
focal radii of the ellipse and hence is also the normal to the ellipse. 
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When s= Aa + zoos 0, the ellipse and the curve [7] have 
the same ire ait at P, and the test for extremes is inconclusive. 
But here the conditions for a maximum or a minimum are obvious 
from geometrical considerations. For, remembering that d,+d, 
is constant for points on the ellipse, say d,+d,=h, then d,+d,<k 
for points within the ellipse and d,+d, > for points without the 
ellipse. Hence the path of the ray will be a maximum or a mini- 
mum according as the curve [7] lies within or without the ellipse 
in the neighborhood of the point P ; and it is seen that [9] and [8] 
are but special cases of this general condition. 


PROBLEM II. Refraction at the surface F (a, y, z)=O0. Using 
the previous notation, tt 1s required to find the conditions that make 


i ae 
the time of passage from E to B, that is, ++ —, an extreme, where 
1 2 
v, and v, represent the velocity of light in the two media. 


The Lagrangian function is (§ 89) 


did 
$(#, Y, => +A +rAFP(H, y, 2)- 
By Sy 

Proceeding as in the case of reflection, we find in place of 
equations [3] above 


[1] 

La 
From these equations we deduce that Fic, 19 
[2] Te eat econ, n), 

OF Ve 
[3] cee Es) = es (20), 

UT 3 

[4] cos(1, 2) ot cos(2, 2) __ k. 


al Vo 
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1 ; 
Multiplying [2] by and [3] by a and subtracting, we have 
| 2 
eS 3 ja nm) _ cos(2, |; 
vp op a 9 
substituting from [4] the value of & in this equation, it is seen that 


1 1 cos*(1,n) _cos?(2, ) 
rT ars vo} 
sin?(1, ”) _ sin?(2, 1) 
or SS ee 
% be 
It follows that 
[5] 


sin(1,) _ sin(2, 2) _ 
Ces 


From [2] and [4] it is seen that 


fe 4 cos (1, 2) pe cos*(1, 2) i cos (1, ae (2, m) 


= 3 a - 
Ae sin?(1, ») _ cos(1, m)cos(2, m)—cos(1, 2) 
tee i] 


Dividing this equation by [5] and then multiplying the result by 
sin (2, 2), we find 
sin (1, n)sin (2, ») = cos(1, n) cos (2, n) — cos(1, 2), 
or cos (1, 2) = cos[(1, n)+(2, n)], 
and therefore 
[6] (1, 2)=(1, n)+(2, n), 
so that the incident and the refracted ray lie in a normal plane. 
Equation [5] may be put in the form ¢ 


sin(1, 2) % 
sin(2, 7”) % 


’ 


where ¢ is the index of refraction of the second medium with 
respect to the first medium. The above is a generalization of a 
problem due to Fermat. 


~S ———— SS 
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The geometrical criteria for a maximum or a minimum involves 
a certain Cartesian Oval whose foci are at B and B and which 
passes through P. Its equation in bipolar codrdinates is 

U_@, + 01d, = const., 
d, and d, being the variable radii vectores. For points on this oval 
1+ — isa constant, say k; for points within this oval aa + en 
ple oe, dard ss 
and for points without this oval ++ —>k. 
"% 

Hence the time occupied by the ray in passing from F to P is a 
maximum or a minimum according as the curve cut from the sur- 
face by the normal plane through — and £& lies within or without 
this Cartesian Oval in the neighborhood of the point P. 


CHAPTER VII 
SPECIAL CASES 


I THE PRACTICAL APPLICATION OF THE CRITERIA THAT 

HAVE BEEN HITHERTO GIVEN AND A METHOD FOUNDED 

UPON THE THEORY OF FUNCTIONS, WHICH OFTEN 
RENDERS UNNECESSARY THESE CRITERIA 


93. The practical application of the established criteria is in 
many cases connected with very great, if not insurmountable 
difficulties, which, however, cannot be disregarded in the theory. 
For often the solutions of the equations § 89, [c], cannot be 
effected without great labor, if at all, and therefore also the forma- 
tion of the function ¢ is impossible. It also happens, even if the 
function ¢ can be formed, that the discussion regarding the coeffi- 
cients of Ae = 0 is attended with much difficulty. Moreover, the 
formation of the function ¢ and the investigation relative to the 
coefficients of Ae are very often unnecessary, since through direct 
observation we may in many cases determine whether a maximum 
or a minimum really exists. If it then happens that the equations 
[c] admit of only one real solution (ie. of a real system of values 
_ @, Ly,++ +, L,), we May be sure that this is in reality the maximum 
or the minimum of the function. In the same way, if we can con- 
vince ourselves a priori that both a maximum and a minimum 
exist, and if it happens that the equations [c] offer only two real 
systems of values, it is evident that the one system must corre- 
spond to the maximum value of the function, the other system 
to the minimum value. 

The determination as to which of the two systems of values 
gives the one or the other is in most cases easily determined. 

One cannot be too careful in the investigation whether on a 


position which has been determined from the equations [a] and 
184 


“ 
5 
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[c] of § 89 there really is a maximum or a minimum, since there 
are cases in which one may convince himself of the existence of 
a maximum or a minimum, when in reality there is no maximum 
or minimum. 

For example, to establish Euclid’s theorem respecting parallel 
lines, one tries to prove the theorem regarding the sum of the 
angles of a triangle without the help of the theorem of the parallel 
lines. Legendre was able, indeed, to show that this sum could not 
be greater than two right angles; however, he did not show that 
they could not be less than two right angles. The method of 
reasoning employed at that time was as follows: If in a triangle 
the sum of the three angles cannot be greater than 180°, then 
there must be a triangle for which the maximum of the sum of 
these angles is really reached. Assuming this to be correct, it 
may be shown that in this triangle the sum of the angles is equal 
to 180°, and from this it may be proved that the same is true of 
all triangles. 

We see at once that a fallacy has been made. For if we apply 
the same conclusions to the spherical triangles, in the case of 
which the sum of the angles cannot be smaller then 180°, we 
would find that in every spherical triangle the sum of the angles 
is equal to 180°, which is not true. 

The ‘fallacy consists in the assumption of the emzstence of a 
maximum or a minimum; it is not always necessary that an 
upper or a lower limit be reached, even if one can come just as 
near to it as is wished (see § 8). 

On this account the assumption of the existence of a real maxi- 
mum is not allowed without further proof. We therefore endeavor 
to give the existence-proof. For this purpose we must recall 
several theorems in the theory of functions.* 

94. We call the collectivity of all systems of values which x 
variable quantities x,, %,,---+,%, can assume the realm (Gebret) 
of these quantities, and each single system of values a position 
in this realm. If these quantities are variables without restric- 
tion, so that each of them can go from — to + 0, we call the 


* Note especially § 137. 
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realm considered as a whole (Gesamtgebiet) an n-ple multiplicity 
(n-fache Mannigfaltigkeit). If a, %,-+++, % are independent of 
one another, then we say a definite position (a1, a, -+-, Gp) lies 
on the interior of the realm if these positions and also all their 
neighboring positions belong to this region; it lies wpon the 
boundary of the realm if in each neighborhood as small as we 
wish of this position there are present positions which belong 
to the realm, and also those that do not belong to it; it lies 
finally without the defined realm if in no neighborhood as small 
as we wish of this position there are positions which belong to 
the defined region. 

If the quantities 7,, %,---, “, are subjected to m equations 
of condition, then we may express these in terms of  — m inde- 
pendent variables u,, Ug,+++; U,—m, and the same definition may 
be applied to these variables. 

95. The following theorems are proved in the theory of func- 
tions: (1)* If a continuous variable quantity is defined in any 
manner, this quantity has an wpper and a lower limit; that is, 
there is a definitely determined quantity g of such a kind that 
no value of the variable can be greater than g, although there 
is a value of the variable which can come as near to g as we 
wish. In the same way there is a quite determined quantity & of 
such a nature that no value of the variable is less than £, although 
there is a value of the variable that comes as near to k as we wish 
(see also § 8). 

(2)t In the region of n variables x,, 2 ,+++, Z,, suppose we 
have an infinite number of positions defined in any manner; 
let these be denoted by (x, «),---, x1). Furthermore, suppose that 
among the positions we have such positions that z!, can come 
as near to a fixed limit a, as we wish. Then we have in the 
region of the quantities x, w,-+--, %, always at least one definite 
position (a, @,-++-+, @,) of such a nature that among the definite 
positions (x, #,+--, ),) there are always present positions that 


* Dini, Theorie der Functionen, p. 68. See also a paper by Stolz, ‘‘B. Bolzano’s 
Bedeutung in der Geschichte der Infinitesimal Rechnung,’’ Math. Ann., Vol. XVIII. 
} Biermann, Theorie der An. Funk., p. 81; Serret, Cale. diff. et int., p. 26. 
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lie as near this position as we wish; so that, therefore, if 5 denotes 
a quantity arbitrarily small, 


|y~—a@|<d (A=1, 2,---, n). 


This position lies either within or upon the boundary of the 


defined region («}, x},-++, a). 
96. This presupposed, let us consider a continuous function 
F'(#,, %, +++, pm), and let the realm of the quantities 7,, x, ---, x, 


be a limited one, so that, therefore, we have systems of values 
which do not belong to it. If for every possible system of values 
(1, Lg,°*+, %) We associate the corresponding value of the 
function, which may be denoted by x,,,, then we have defined 
certain positions in the region of +1 quantities. For the 
quantity ~,,, there is according to the first 
theorem an upper limit a, ,,; consequently, 
owing to the second theorem there must be 
within the interior or upon the limits of the 
defined region a position (41, dy, +++, Gyn, Gn 41) 
of such a nature that in the neighborhood of O % a Wy 
this position there certainly exist positions fo Ds 
which belong to the region in question. ae 

Now if it can be shown that this position lies within the interior 
of the region, then there is in reality a maximum of the function 
on the position (@,, +++, @,); on the con- 
trary, if the position lies on the boundary, 
we cannot come to a conclusion regarding 
the existence of a maximum of the func- 


The case of a maximum 


The case of a minimum 


tion %y44- 
" Oma a Ho 
r cases happen that. one zs 
It may in many € PP ne Heya aereD 
can. show, if (x1; Hy, * * *% Lp) 1S any position Fig. 21 


on the boundary of the realm and if 2,4, 

denotes the corresponding value of the function, that there are 
present within the realm positions for which the values of the 
function are greater than for every position on the boundary. Then 
the position which we are considering here cannot lie upon the 
boundary, and it is clear that the limiting value of the function 


138 THEORY OF MAXIMA AND MINIMA 


can be assumed for a definite position within the interior, since 
the function varies in a continuous manner., The analogue is, of 
course, true for a minimum. If, however, 
it does not admit of proof that there are 
positions on the interior of the defined 
realm for which the value of the function 
is greater or smaller than it is for all 
positions on the boundary, then nothing 
can be concluded regarding the real exist- Oi ae 
ence of a maximum or a minimum; the DES ae 
position (a, a, +--+, @,) would then lie shag 
on the boundary of the region, and there might be an asymptotic 
approach to the limiting value a, ,, without this value’s being in 
reality reached. Such cases need especial attention. 

The figures give a plain picture of what —_yyarimum on the limiting 
has been said for the case y= f(x), where « ORIN a 
is limited to the interval (2, ---«---+ %,). 

97. Analogous considerations of the 
above are fundamental in the very defi- 
nition of an analytic function. For con- 
sider a power-series of x assumed or given Oo) erie He 
in any manner ; let a! be a definite value [aa fGe rs) 
of «. Then there are three possibilities : pees 

(1) a may lie in the region of convergence of the given series 


Case of asymptotic approach 


or of a series that is derived (§ 138) from the given series ; the value . 


for «=x! of this series is a value of the analytic function which 
is determined through the original series. In other words, if with 
Weierstrass we call the original series as well as any other series 
derived from this one with regard to the function which it repre- 
sents a function-element (Functionenelement), then the first sossi- 
bility consists in that, if any function-element is given, the definite 
value x’ lies in the region of convergence of a function-element 
which is derived from the given one. We admit here also the 
complex variable. 

(2) It may happen that x! does not lie in the region of con- 
vergence of any series that has been derived in this manner and 
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that we cannot derive from the original function-element another 
Sunction-element whose region of convergence can come as near to 
the point x' as we wish. In this case the function does not exist 
fora = i270 « 

(3) Although we cannot find a power-series within which «x! lies, 
nevertheless, it sometimes happens .that we may still derive elements 
whose regions of convergence contain positions which can come as 
near to the point x! as we wish. Whether we can then define the 
function for z=! by the consideration of boundary conditions 
must in each case be considered for itself. 

If we have case (1) before us, then the function is defined 
not only for every value x’ but also for all values in the neigh- 
borhood of «' and has for these values the character of an integral 
function. 

The definition of an analytic function as thus given is prefer- 
able to other definitions from the fact that the existence of 
general analytic functions is at once recognized; in short, that 
we have under our control, in our possession, all possible analytic 
functions. Every possible power-series within a region of con- 
vergence gives rise to the existence of a definite analytic function. 
Moreover, one must assume the duty of proving in the case of 
every example that it leads to just such functions. 

For this reason investigations are necessary of which formerly 
we find no trace. If we have a differential equation, we must 
. begin with the proof that the functions which satisfy the differ- 
ential equation arise from such function-elements as we have just 
explained; that is, we must first show, if y is the unknown func- 
tion and « is the variable of the differential equation, that this 
equation can be satisfied through y=P(«—a). Reciprocally, if 
any variable quantity y is so connected with another variable 
quantity x that it satisfies the differential equation, we must 
show that it may be derived from one single function-element in 
the manner indicated. This last proof is of especial importance 
in the application of analysis to geometrical mechanics. 

When a problem is given in mechanics, we have to represent 
the coordinates of the moving point as functions of the time. 
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Only real values are permitted in this problem. We cannot 
therefore a priori know whether the required functions are 
analytic or not. : 

-These functions are generally defined through differential equa- 
tions. We shall give the simplest case as an example. Suppose 
we have a system of points that attract one another according 
to an analytic law, and let 2,, %,--+-, Z, be the codrdinates of 
these points. If the motion is a free one, we have the differential 


equation in the form 


rb i 
Seg ee ie la, 22% 5 Bn), 
where F denotes a given function of %,, %,--+, £,. With such 


a problem we have to prove before everything else that the 
required functions of time are analytic functions. If for the 
point ¢=¢, the initial position and the initial velocity are given, 
then in the neighborhood of the initial position we can find 
power-series, and we have to show that through these power- 
series the required functions are completely determined. 


II. EXAMPLES OF IMPROPER EXTREMES WHERE THE DIF- 

FERENCE F(a, + hy d+ hoy ++) Ant hin) — F (ayy Gg) - ++) dn) IS NEITHER 

POSITIVE NOR NEGATIVE BUT ZERO ON THE POSITION 
(a4, dy)+++, d,) WHICH IS TO BE INVESTIGATED 


98. We shall now consider a case which is not included in the 
previous investigations, but may be in a certain measure reduced 
to them: The definition of the proper extremes of a function 
consists in the fact that the “difference 


F(a, + hy, agatha, +++, dy + hy) — F(a, dg, +++, Op) (2) 


must be invariably negative or invariably positive. There are cases 
where an extreme does not appear on the position (a1, d,-- +, &,) 
in the sense that the above difference must be positive or nega- 
tive, but in the sense that the difference must be zero. 

Suppose, for example, we have the problem: Determine a 
polygon of 2 sides with a given constant perimeter S whose area 
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is a maximum,—a problem which we shall later discuss more 
fully (see § 101). 

If this maximum is attained for a definite polygon, then we 
may at pleasure change the system of codrdinates by sliding the 
polygon in the plane without altering the area. 

For example, let n = 3, and (2, 9;), (#2, Ya), and (2g, yz) be the 
coordinates of the vertices of the triangle. Then the expression 
which is to be a maximum is 


P= ¥(XYq— Uy + Lag — L3Yq+ Ley — L,Yp), 
where the variables are subjected to the condition 
S=V (ty— a)?+ (YoY)? +V (a3 — £9)? + (Y3— Yo)" 
5 V(x — &3)°+ (Yy— Ys)”. 


There will not only be one system of values which gives for 
Fa maximum value, but an infinite number of such positions; 
since, if we take the triangle in a definite position, we may move 
it in its plane at pleasure. This is therefore a case where the 
difference (z) is not positive or negative but zero. 

99. Such cases, however, may be reduced to maxima and 
minima proper if we choose arbitrarily some of the variable 
- quantities. In the special example of the preceding section we 
may assume a vertex of the triangle at pleasure; let it be the 
origin of codrdinates, and we further assume that one of the 
sides coincides with the positive direction of the Y-axis, so 
that. we may write z,= y,=¥,=0. If we agree that the triangle 
is to lie above or below the X-axis, the problem is completely 
determinate. 

In so far as the necessary conditions for the existence of an 
extreme are concerned we may proceed in precisely the same 
manner as we have hitherto done, since under the assumption 
that there are no equations of condition we have 


=P hePral yy gy +5 Bp) (ley Igy «+ 45 hy) ®. (ii) 
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If a minimum is to be present, then this difference can never be 
negative, but may be zero. For this to be possible the first deriv- 


a@=n 
atives must all vanish. Since, if the sum > teFa CAR Pens) 
had (say) a positive value for hy=¢,, hg=¢g, a -, Ay = Cy, then we 
could place h, equal to c,2 and then choose / so small that the 
sign of the right-hand side of (i) would depend only upon the 
sign of the first term. If we then make / positive or negative 
the difference would also be positive or negative. 

If equations of condition are present, it may be shown, as 
above, that the derivatives of the first order must vanish, since, 
if all these derivatives did not vanish, we might express some 
of the #’s through the remaining ones, and then proceed as we 
have just done. The required systems of values (7, %,+++, Zp) 
will therefore be determined from the same equations as before. 

100. If we have found a system of values of the 2s which 
satisfy the equations of condition of the problem, then in the 
neighborhood of this position there will be an infinite number 
of other positions which satisfy the equations. These last are 
characterized by the condition that the difference (7) vanishes 
identically for them. 

This is just the condition that made impossible the former 
criteria, by means of which we could decide whether an extreme 
really entered on a position (a1, @,--++, a,) that was determined 
through the equations in 2, %,+++, ap. 

One must therefore seek in another manner to convince him- 
self which case is the one in question. 

This is further discussed in’ the following problem: 

101. PRoBLEM. Among all polygons which have a given number 
of sides and a given perimeter, find the one which contains the 
greatest surface-area. (Zenodorus.) 

We see at once that the problem proposed here is of a some- 
what different nature from the problems of §$ 90 and 91, since 
the existence of the maximum value of the function is no longer 
the question, as was proposed in § 49 and held as fixed through- 
out the general discussions. For if the definition of the maximum 


EE ———— rl 
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is such that the function on the position (a,, a,,++-+, a,) must have 
a greater value on this position than on all neighboring positions, 
then in this sense our polygon could certainly not have a maximum 
area; since, if we had such a polygon on any position, we might 
slide the polygon at pleasure without changing its shape and con- 
sequently its area. Therefore only a maximum of the area can 
enter, in the sense that the periphery remaining the same an in- 
crease in the area of the surface cannot enter for an indefinitely 
small sliding of the end-points. We consequently cannot apply 
our general theory without further restriction. 
102. Let the coordinates of the n end-points taken in a definite 
order be Lis V3 Las Yash Gur Ye 
The double area of a triangle which has the origin as one of its 
vertices and the coordinates of the other two vertices «,, y, and > 
Lo, Yo 18, neglecting the sign, determined through the expression 
EEE Ee 

To determine the sign of this expression we suppose that the 
fundamental system of coordinates is brought through turning 
about its origin into such a position that the positive X-axis coin- 
cides with the length 01. We call that side of the line 01 posi- 
tive on which lies the positive direction of the Y-axis: The double 
area of the triangle 012 is to be counted positive or negative 
according as it lies on the positive or negative side of the line 01. 

If the point 0 has the codrdinates 2), y), the double area of 
the triangle is 

2 Ania = (1 — Xo) (Y2— Yo) — (Y1 — Yo) (Ha — Xo) 

where the above criterion with reference to the sign is to be applied. 

For the polygon we shall take a definite consecutive arrange- 
ment of the points (1, 2,---, m) and, besides, we shall assume that 
no two of the sides cross each other. The last hypothesis is 
justifiable, since we may easily convince ourselves that if two 
sides cut each other we may at once construct a polygon whose 
sides do not cut one another and which, having the same perim- 
eter as the first polygon, incloses a greater area. 
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Within the polygon take a point 0 =(zp, Y) and draw from it 
in any direction a straight line to infinity. This straight line 
always cuts an odd number of sides of the polygon. 

Now if we follow the periphery of the polygon in the fixed 
direction (1, 2,---, 7) and mark the intersection of a side by the 
straight line with + 1 or —1, according as we pass from the nega- 
tive to the positive side of that line or vice versa, then the sum of 
these marks is either +1 or—1. In the first case we say that the 
polygon has been described in the positive 
direction, in the second case in the nega- 
tive direction. 

It may be proved* that whatever point 
be taken as the point 0 within the poly- 
gon and in whatever direction the straight 
’ line be drawn, we always have the same 
characteristic number +1 or —1 if in each case the positive 
side of the straight line has been correctly determined. 

103. The double area of the polygon is 


5 2 


Fie, 24 


= (1 — 2) (Yo— Yo) — (La— Xp) (Y1— Yo) + (2— Xo) (Y3— Yo) 
— (®g—XLy) (Yo-Yo) + + + + +(Ln—%o) (Yy—Yo) — (LH — Xp) (Yn—Yo) 3 
or 2 FH LyYo—LqY1 + LoYg—LgYqt +++ +%pYy~—LyYn» (@) 


where the positive or negative sign is to be taken according as the 
polygon has been described in the positive or negative direction. 
We may, however, eventually bring it about through reverting 
the order of the sequence of the end-points that the expression 
2 F is always positive. 

104. Suppose that this has been done, The function 2 F is to 
be made a maximum under the condition that the periphery 
has a definite value S. 

We may write 


$a 8,9 *** +3,-1 nt Sn,p=—)) (8) 
where S-1aA= V(a— %p_1)® + (Yx— Yn—1). (Y) 


* The proof is found in Cremona, Elementi di geometria projetiva. Rome, 1873. 
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Form the function 


G= 2F+ (59+ 8,3++-+ +8,,1—S), (8) 
and placing its partial derivatives equal to 0, we have 

0G 2 eam oe 7 
gee naa e( Fatt 4 AO i) 0 

Hy SA+1,A §\-1,d 
0G _ Y,— «© 
— =— Bayt mate(Y Yrst 4 Yr thet) 

CY SALA SA —1, A 


(A=1, 2,--+-, 2; however, for A =n we must write \+1=1). 
Take in addition the equation (8) and we have 2” +1 equations 
for the determination of the 27+1 unknown quantities 

MOG ey tee 1) Yay" Uns Se 
105. To reach in the simplest manner the desired result 


from these equations, we adopt the following mode of procedure. 


t 
es te 2 = (%q— ®_y) + UY. — Yr-1)s (6) 


then z,, geometrically interpreted, represents the length from the 
point %—1 to the point » both in value and in direction. 
If, further, we write 


= (®— @_1)— TYa— Yr-1)s (n) 
then %* = S10 (9) 
Multiply the first of equations (e) by 7 and subtract from the result 
the second; then owing to (¢) we have 


“f __*X +1 
at aartei( == |= 0, 


Suet CA cal 


et 1e€ 
aie 5), r41 


ew eu 
fi jana OA 
A(1— J=—Aer(1 ) 
Siti A Sr,A41 


Now, multiplying the last two equations together, we have from (0) 


2 é 2 if é 
CR ale 1+ 2 SE eal ae Y 


2, 
SA—1,a Sr, A+1 


(4) 


and therefore $34, v= SF ate 
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106. Since s,_;, is an essentially positive quantity, it follows 
that S=1 A AA, (k) 
and consequently the sides of the polygon are all equal to one 
another. Hence each side = -, and we have from (¢) 


241, _ em+S 


=— = const. 
a ein — 
: Sie 
If we write hea ert, 


where ¢, denotes the angle which s,_,,, makes with the X-axis, 


then e(?a+1— %r)4— eonst., 


or $,41— $= const.; (A) 
that is, all the angles of the polygon are equal to one another, 
and consequently the polygon is a regular one. 

It is thus shown that the conditions which are had from the 
vanishing of the first derivatives can be satisfied only by a regular 
polygon ; that is, if there is a polygon which, with a given perim- 
eter and a prescribed number of sides, has a greatest area, this 
polygon is necessarily regular. 

Our deductions, however, have in no manner revealed that a 
maximum really exists. 

107. To establish the existence of a maximum we must apply 
the method given in §§ 93-96. We note that an upper limit 
exists for the area of the polygon, from the fact that the number 
of sides and the perimeter are given; for if we consider a square 
whose sides are greater than the given perimeter S, we can lay 
each polygon with the perimeter S in this square, and in such 
a way that the end-points of the polygon do not fall upon the 
sides of the square. Hence the area of the polygon cannot be 
greater than that of the square, and consequently there-must be 
an upper limit for this area, which may be denoted by Fy. The 
question is, Can this limit in reality be reached for a definite 
system of values? The variables 2,, ¥;3 2, Yo} +++} Ln» Yn being 
limited to this square, there must be (§ 96) among the positions 
(@, Yy3 Lg, Yogi +++} Lmy Yn) Which fill out the square a position 


C—O EEE eer eee ee ee 
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(4, 0,3 Gy, bg; +++; ad, b,) of such a nature that in every 
neighborhood of this position cther positions exist for which the 
corresponding surface area F of the polygon formed from them 
comes as near as we wish to the upper limit. We may assume 
that this position is within the square, since if it lies by chance 
on the boundary, then from what has been said above, it is 
admissible to slide the corresponding polygon without altering 
its shape and area into the interior of the square. 

We assert that the value of the function / for the position 
(21, 613 yg, bg; ++ +3 G, 6,) must necessarily be equal to F,. For 
if this was not the case, the inequality must also remain if we 
subject the points a, b,; dg, 0,; +++; a, b, to an indefinitely 
small variation; and on account of the continuity of F it would 
not be possible in the arbitrary neighborhood of (a1, 0,3; +++; an, ,) 
to give positions for which the corresponding area comes arbitrarily 
near the upper limit /. This, however, contradicts the conclu- 
sions previously made. Hence all n corners with a given periph- 
ery not only approach a definite limit with respect to their 
inclosed area but this limit is in reality reached. Since, furthermore, 
the necessary conditions for the existence of a maximum have 
given the regular polygon of m sides as the only solution, and 
since we have seen a maximum really exists, we may with all 
rigor make the conclusion: That polygon which, with a given 
periphery and a given number of sides, contains the greatest area 
is the regular polygon. 

PROBLEM 


Among the regular polygons with a constant periphery, the one with 
the greatest number of angles has the greatest area. (Zenodorus.) 

108. Hadamard’s problem. If 4, =(#,, 4%, 2), 4g=(os Yor %); 
Ag= (3, Y3, 23) are the rectangular coordinates of any three points 
from a fixed origin O, the volume formed on the three lines 04), 
OA,, OAs i8 RW ent 
A=/%, Yo. %|5 

X3, Ya, % 


and if oi+y2+2e2—d? (i=1, 2, 3), 
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where d,, d,, d, are positive constants, it may be easily shown 
that A is a maximum when A=d, -d,-d,; or, of all parallelo- 
pipedons constructed on the three sides OA,, OAy, OA, the one 
having the greatest volume is the rectangular parallelopipedon. As 
the parallelopipedon may occupy an infinite number of positions 
without changing the origin, we have here a case of improper 
maximum which is of interest. 
The extension of this problem is due to Hadamard.* 


: r pane 
Mise “4ia: eae 


ie Z, Pie he 
eee 22) ? 2n | 
If A “nl . . . . . . . 


Enq» Eno» an ne & 


where ai +ae+.-.-+a2 —d?(i=1, 2,---, ns the d’s being 
positive constants, show that the maximum of the absolute value 
of A ts A=d,-d,- --- +4 


This may be done as follows: 
Let the determinant be developed with respect to the elements 
of the 7th line, so that 
A= Aj 181+ Ajokjigt +++ +AinZin- (2) 
We then have to find the maximum or the minimum of the func- 


tion A of the n variables #;1, %9,+++, Z%, Which are connected 
by the relation 


w+ D+ +++ + ah a (w) 
The Lagrange method (§ 89) leads at once to the conditions 
TI Le Lin = 
re aU 
Aiy Aj» Ain ( ) 


Tf 21, 2pq,+++, Len are the elements of another line of the 
determinant, we have 


Aj 1% + Ajolpat > ++ + Ain®en = 0 5 (wv) 
or, from (22), @j12py + VjoXpgt +++ HLin%pn= 0, (v) 
where 4+ k. 


* Hadamard (Bull. des Sciences Mathématiques, Second Series, Vol. XVII, 1893). 
Proof by Wirtinger (ibid., 1908). An interesting application of this problem is found 
in Bocher, Introduction to the Study of Integral Equations, pp. 31 et seq. 


ia 


OO ee eS Ss ee 


a 


SPECIAL CASES 149 


From this we conclude that the determinant can only have an 
extreme value when it is orthogonal. 

When the conditions (v) exist, the square of the determinant 
is another determinant, in which all the elements are zero except 
those of the principal diagonal, which are d?, d3?,..., d?. 

It follows that A?= d?. d?,..-, a. 


Here again we have an improper extreme which it is interesting 
to consider further. 


II. CASES IN ‘WHICH THE SUBSIDIARY CONDITIONS ARE 
NOT TO BE REGARDED AS EQUATIONS BUT AS LIMITATIONS 


109. Besides the problems already mentioned, those problems 
are particularly deserving of notice in which the conditions for 
the variables are not given in the form of equations but as 
restrictions or limitations. 

For example, let a point in space and a function which depends 
upon the coordinates of this point be given. Furthermore, let the 
point be so restricted that it always remains within the interior of 
an ellipsoid ; then the restriction made a the point is expressed 


through the inequality pe 


0254845 La 


We have, accordingly, such limitations when a function of the 
variables is given which. cannot exceed a certain upper and a 
certain lower limit. 

We make such a restriction when we assume that a function 
J, shall always le between fixed limits a and 0. 

110. This limitation, which consists of two inequalities 

[a] @ <7, 0; 
may be easily reduced to one. 

For from [a] it follows necessarily that 


[6] Jee ye 


and, reciprocally, if [8] exists and if a< 0, then f, must be 
situated between a and b and consequently [a] must be true. 
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Every limitation of the kind given may be analytically repre- 
sented as one single inequality of the form [A]. 

111. We must next find the algorithm for the cases under 
consideration. This may be done at once if we consider that 
such cases may be reduced to those in which occur equations of 
condition. For this purpose we need only establish the problem 
of finding the maximal or minimal values of a function whose 
variables are subjected to certain conditions as follows: 

It is required among all systems of values which satisfy the 
equations f,_,(N=1, 2,-++, m) to find those for which F is a 
maximum or a minimum. 


By proposing the problem in this manner, it is clear that all 
the variables « which appear in the equations of condition need 
not necessarily be contained in the function. 

Suppose further we have the limitation that 


Ly] Jee Ue 


then, through the introduction of a new variable z,,,;, we may 
transform this limitation-into an equation of condition. For, as 
we have to do with only real values of the variables, the equation, 


[y7] Ji: = od 
denotes exactly the same thing as [y]. 

If, therefore, a function F'(a,, Lo, +++, Z) 1s to be a maximum 
or minimum under the limitations 


A=Oh,= 0, ee > Jan= 0, Jas = ieee. es ee Payee. 


where the /’s are functions of #,, 7,---, Z,, then we may solve 
this problem if instead of the 7 last restrictions we introduce the 
following limitations : 


ae oe 2 
Tet = t+ Sm+2= Un+99°% JSnee= Trtr 


The problem is thus reduced to the one of finding among the 
systems of variables 2,, %,-++, %,, those systems for which F 
is @ Maximum or a minimum. 

112. Examples of this character occur very frequently in 
mechanics. As an example consider a pendulum which consists 


% 
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of a flexible thread that cannot be stretched. The condition under 
which the motion takes place is not that the material point remains 
at a constant distance from the origin, but that the distance can- 
not be greater than the length of the thread. Such problems are 
more closely considered in the sequel. It will be seen that by 
means of Gauss’s principle all problems of mechanics may be reduced 
to problems of maxima and minima. 


IV. GAUSS’S PRINCIPLE 


113. For the sake of what follows we shall give a short ac- 
count of this principle: Consider the motion of a system of points 
whose masses are 71, M,++-, M». Let the motions of the points 
be limited or restricted in any manner, and suppose that the system 
moves under the influence of forces that act continuously. For a 
definite time let the positions of the points and the components 
of velocity both in direction and magnitude be determined. The 
manner in which the motion takes place from this period on is 
determined through Gauss’s principle: 

Let A,, A,,---, A, be the positions of the points at the moment 
first considered; B,, B,,---, B, the positions which the points 
can take after the lapse of an indefinitely small time 7, if the 
motions of these points are free; Cj, C,,---, C, the positions in 
which these points really are after the lapse of the same time 7; 
and, finally, let C{, Ch, +++, CG, be the positions which the points 
may also possibly have assumed after the time 7, when the 
conditions are fulfilled. 

If we form 


v= 


a) v=Nn a 
m,B,C, d BO nw, 
Pas an 2a 


it follows from Gauss’s principle that from 7=0 up to a definite 
value of t the condition 


v=? v= 


P| a7. B, B,C << Dae B, BC 


v= 


is always satisfied ; that is, ym B, age, must always be a minimum. 
v=1 


ra 
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114. To make rigorous deductions from Gauss’s principle, which 
was briefly sketched in the preceding section, we shall give a more 
analytic formulation of it: For this purpose we denote the coordi- 
nates of A, by 2,, y,, %,, the components of the velocity of A, by 
a, y,, 2, and the components of the force acting upon A, by 
X,, Y,, Z, The coordinates of B, are therefore 


2 2 = 
tty + 12y + Ky wtytSY,, zt +54, 


and from Taylor’s theorem the coordinates of C, are 


2 ees " 
Y, fees, 7 i et ica eet =f = + 95 


reeusiae, ue 
L+ATH, +—H, +--+, YtTY, + ; 


2 2 


consequently we have 
———)) rt 
2 B= XP +(y)'—V,)P? +(e) —Z,)} — +. +s 
[2] DmB,C, =D ym {(@'— XP +(ys' VP +@'-Z)3 7+ 


Instead of x,’, however (see preceding section), other values may 
possibly enter, say x,/+ &,-+-, so that we have 


v=n 


[3] >" B,o. Ps ig += xP 
+ (yi +4, =X.) Fie oe ae ae - 
It follows from Gauss’s principle that the difference of the sums 
[2] and [3] must always be positive. 
Hence 


[4] 0>Sm, $2 [E,(a,)'-X,) + n (y nee” 
+E tne RT ms 


that is, the quantities x’, y/’, z/’ must be such fo the sum [2] 
is a minimum. 

Hence, among all the a’, y/’, z/’ which are associated with 
the conditions of motion, we must seek those which make [2] 
a minimum. 

115. We have reached our proposed object if we can show 


that the ordinary equations of mechanics may be derived from 
Gauss’s principle. 
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If there are no equations of condition present, then clearly [2] 

is only a minimum when 
fy UE hn ea 
If, however, we have equations of condition, for example, if any 
of the variables are connected by a relation such as f (x, y, z)=0, 
then these must hold true throughout the whole motion. They 
may therefore be differentiated. We have in this way equations 


“, se *, and a. Differentiate again and we have equations 
in a’, y}', and 2’. 

Hence, in conformity with the rules that have been hitherto 
found for the theory of maxima and minima, the quantities 
a,', y|', 2[' are to be so determined that the derived equations 
of condition are satisfied, while at the same time [2] becomes a 
minimum. But in this case also, as is easily shown, we are led 
to the usual differential equations of mechanics. 

116. The theory of maxima and minima may be applied 
to realms which are seemingly distant from it. An example 
in question is the proof of a very important theorem in the 


theory of functions. 


THE REVERSION OF SERIES 


If the following n equations exist among the variables x, 


La,°**%) Ens Yys Yor* +> Yw 


Yq = yy % + Ayy% + +++ + Unt t+ X> 
Yo = F% Ais Agathe =F exe, fe Hanky >. 
i em ee okey cas ae 
(on = On 1% + On gila + oe Bnnkn +X,» 


where the coefficients on the right-hand side are given jinite quan- 
tities and the X’s are power-serves in the xs of such a nature 
that each single term is higher than the first dimension, and 
if the series on the right-hand side are convergent and the 
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determinant of the nth order of the linear functions of the 
as which appear in [1], namely, 


Hy, M2, +**, Un 

s;, ‘Gone 
[2] A=| 72) 22) » “On ; 

Ona, Ina.°**s Inn 


is different from zero, then, reciprocally, the xs may also be 
expressed through convergent series of the n quantities y which 
identically satisfy the equations [1]. 
117. As an algorithm for the representation of the series for 
the «’s, we make use of the following methods (cf. §§ 135, 136): 
If we solve the equations [1] linearly by bringing the terms of 
the higher powers of the 2’s on the left-hand side, we have 


A es A n 
Bo i Nala Koo ee 


A, An 87 Ae 
=F Wav te Ware) ee ee 


where 4,, denotes the corresponding first-minor of @,, in [2]. 
It is seen that in general 


iL 


A ei ee Ly, Lo, +++, 2, ]O 
[3] = Dog Wer ea) De rE vce ¥ = Ik ? 
= 


= 


where [#, %,--+-+, #,]® denotes terms of the second and higher 
dimensions in 2, %»,+++, Lp. 

We shall therefore have a first approximation to the result if 
we consider only the terms on the right-hand side of [3] which 
are of the first dimension. A second approximation is reached 
if we substitute in the right-hand side of [3] the first approxima- 
tions already found and reduce everything to terms of the second 
dimension inclusive. Continuing with the second approximations 
that have been found, substitute them in [3] and, neglecting all 
terms above the third dimensions, we have the third approxi- 
mation, etc.; we may thus derive the 2’s to any degree of 
exactness required. . 
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Since 4 is found in all the denominators, the development 
converges the more rapidly the greater A is. 

118. In what follows we shall assume that the quantities on 
the right-hand side of [1] are all real and that we may write 


X= Fat st Hat --- A=1, 2 i ), 


where /,; is a homogeneous function of the ith degree in 2, 


%,°++, ,, and consequently by Euler’s theorem for homogeneous 
functions 


See shee AGERE 1 3H, 
SS 37 dc, 2°? da, es 
LHe a aor, 1 ys 
eg Lipp es trae Be” ba, 
REN Ve AEE ine OFA 
se ae er iar ey Chey ia at 

+ 


BE Xe. = od | X, aa ae Ne — ct en XGe 


where the quantities ee (Naa, 2, 2 5 5h pe Ly 2h lee) are 
continuous functions of the z’s, which become indefinitely small 
with the 2’s. 

The system of equations [1] may then be brought to the form 


L=n : 
[1a] => (Am +5) Ly (A = £ 2, eo: nN). 
w=] 


The theorem of § 116 in this modified form may be expressed 
as follows: 


(1) It is always possible so to fix for the variables ay, X, +++, Ly 
ANd Yx5 Yor +s Yn» WMUS Gy, Jars In and hy, hg, +++, hy that for 
every system of the y's for which |y,|<h, there exrsts* one 
system of the x's for which |a,|<g,, and in such a way that 
the equations [la] are satisfied. 


* See Biermann, Theo. der An., Funk., p. 234, and also Stolz, p. 172. 
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(2) The solution of the equations [1a] has a form similar to 
the equations [1a] themselves, viz. : 


H=N/ 4 
[3 a| nae +Y,,) Yu (X = A, 2, Mea) N), 
b= 


where the Yes are continuous functions of the y's, which become 
indefinitely small with these quantities. 


To prove this theorem we make use of the theory of maxima 
and minima. 

119. If we give to the y, the value zero, the equations [1a] 
can only be satisfied if their determinant vanishes, that is, when 


[4] [tm +X, |= 9 (hd, 2, - 60 pegigh = 1, 2)-- =; a), 


except for the case where the 2’s vanish. 

For sufficiently small values of the z’s the determinant [4] is 
not very different from the determinant [2]. We may therefore 
determine limits g for the as so that [4] cannot be zero unless 
[2] is also zero. A=0O is, however, by hypothesis excluded. 
Accordingly the y’s can only be zero in [1a] when all the z’s 
vanish, provided the «’s are confined within fixed limits. These 
limits may be regarded as the bowndaries of a definite realm. 

120. Again, we write 


w=n 
[5] Yr =>) (4 +X.) Ly =F, (x, Xo, ee Ly) 
=1 
4 (A = 12, = <=, 2); 


and consider the function 


A=n 


[6] S= RF (x4, Xo, ee Lp)» 
A=1 


In S we shall write for the z’s all the systems of values where 
at least one « lies on the boundary of the realm in question. 
The realm of the 2’s is thus the totality of the z’s for which 


| teed is zero only when |a,,| = 0 
(A =1, 2, +5005 w= 1, yr) 


it follows then that [4] is not zero, since |a,,| is by hypothesis 
different from zero. 


SPECIAL CASES 157 


When one of the z’s reaches its limit, there is no system of 
values of the 2’s for which the function [6] vanishes, since the 
function can (as follows from definition [5] of the F, and the con- 
siderations of § 119) only vanish if all the y’s and consequently 
all the z’s vanish. 

There is then a lower limit G@ which is different from zero 
for the values of [6] which correspond to a definite system of 
values (21, +++, £,) of the boundaries. 

121. We come next to the determination of the limiting values 
of the y’s. For this purpose we consider the expression 


A=n 


[7] (A (, a) Lp) = Yr) « 
A=1 


If we ascribe definite values to the y’s, then there is for the values 
[7] in the realm of the z’s a system for which [7] is a minimum. 
We wish to show that this system of values of the 2’s does 
not lie upon the boundary of the realm. We prove this by show- 
ing that there is a point within the realm where the expression 
[7] has a smaller value than it has on the boundary. 
The expression [7] may be written 


A=n 


A=n A=n 
> —yP= S— 2 Fyn ti pare 
A= A= AZ1 
Since VS is at all events greater than /,, and consequently 


Ey 
az Al, 
it follows that S 
w=n _h=n _ =n =n =n 
DPI. < VS DH.<VS|Dy|<VS D|n.|<VS Yhw 
w=l1 w=l p=l w=l p=1 
where the h’s are the limits of the y’s. From this it results that 
=n w=n =n 
> (F.- Gaye Ea 2VS hd yi 
w=l1 w=1 p= 


and, consequently, for a greater reason 


[8] DF _ y,)2 > 8—2VS DE hy 
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The limits h, must be so chosen that the right-hand side of 
[8] is positive. This choice can be made so that the expression 
on the right-hand side for a system of z’s which belongs to the 
boundary does not become arbitrarily small but always remains 
greater than a certain lower limit (see the preceding section). 

The expression, however, on the interior of the realm of the 
zs may be arbitrarily small, viz, when 2,=%,= +--+ =4%,= 0. 

For this system of values the left-hand side of [8] is equal to 

eer 
pe 
w= 

We have therefore found the following result: We can give 
limits g to the variables x, and to the y’s the limits h, in such 
a way that the expression [7| for systems of values of the 2’s 
which belong to the boundary of the realm is always greater than 
it 1s for the zero position (4%, = %,= --- =%,=0). 

Hence the position for which the expression [7] is a minimum 
must necessarily lie within the realm of the xs; and we may be 
certain that within the realm of the x’s there is a position where 
[7] has its smallest value. 

122. In order to find the minimal position of [7] which was 
shown to exist in the previous section we must differentiate the 
function [7] and put the first partial derivatives equal to 0. 

This gives 


v=n oF 
‘ = = — DSS. 
[9] PE Y,) 0x, (mo ile 2, A N). 
These n equations can, in case the determinant 
oF, 
[10] a, v=, By 2's 55 90S w=, yeaa nN) 


is different from zero, exist only if the quantities within the 
brackets vanish. 

[10] is identical with the determinant [4]; and (see § 119) 
it may be always brought about through suitable choice of the 
limits g of the «’s that [4] is different from zero if only the de- 
terminant [2], as by hypothesis is the case, is different from zero. 
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Hence the equation [9] can only be satisfied if 
[1a] or [5] YW =f, (x, Loy su Aaa Ly). 


We have therefore found that, since there is certainly a system 


v= 


of values of the x's for which thé function > y, — F,)* is a mini- 


mum, there must also be a system within He a of the «xs for 
which the equations [1a] are satisfied if to the y’s definite values 
in their realm are arbitrarily given. 

123. We must further see whether within the fixed realm 
there is one or several systems of values of the a’s that satisfy 
the equations [1a] with prescribed values of thé y’s which lie 
within definite limits. 

To establish this, assume that (aj, 2,---, x.) is a second 
system of values that satisfy the equations [la]; we must then 
have the equations 


[11] F, (%4; py ae In) — F, (#4, Har°* %) Lp) = 0 (v=1, 2, a das 1). 
Developing by Taylor’s theorem, we have, when we consider only 
terms of the first dimension, 


p=n 


[lla] 2 — a) { = 


The xe , are functions whieh depend upon the #’’s and «’s and 
vanish with these quantities. 
We may determine the n unknown quantities 2 — a, %— #, 
., v/ — x, from the n linear equations [11@]. 
For small values of z and «' the determinant 


Cp ca Le Agnetcnesago) 


will be little different from the determinant [10]. 

We may therefore make the limits g of the a’s so small that [12] 
is different from zero for all the 2’s and «”s which belong to the 
realm; and when this has been done, the equations [11] are only 


satisfied for =a, (p=1, 2,--:, 2); 


vp 


[12] 


that is, there ewists within the realm in question no second system 
of the x's which satisfies the equations [1a]. 
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We have therefore come to the following result : 

It is possible so to determine the limits g and h that with every 
arbitrary system of the y's in which each single variable does not 
exceed its definite limiting value, the given equations [la] are 
satisfied by one and only one system of the x's in which these 
quantities likewise do not exceed their limits.* 


The first part of the theorem given in § 118 is thus proved. 


Remark. We have assumed that we have to do only with real quantities. 
The discussion, however, is not restricted to such quantities, as it is easy 
to prove that the same developments may also be made for complex 
variables. 


124. The values of the «’s which were had from the equations 


p=n 


[1 a] = Z(t +X, ,)% (v == iis 2. see, 2) 


may be derived in the manner given in § 118. 
If we write : 


+X, 


the linear solution of the equations [1 @] is 


= A! @=1, 2,---,n; p=1, 2,---, ), 


v=n yg! 


[3 6] “=> EY, (pe =1, 2.0%, 2), 
v=1 A 

where Ae denotes the corresponding first minors. Now J! is a 

definite quantity which lies within certain finite limits; the same 


1 ik : ime 
is also true of wie A,, is found in a similar manner. Hence 


eee ee: 
the quantities rib are finite quantities which lie between definite 


limits; and, therefore, if the y’s become indefinitely small, the 
«’s will also become indefinitely small; that is, those systems of 
values of the x’s which satisfy the equations [1] under the named 
conditions are, as has also been shown in § 119, so formed that 
they become indefinitely small with the y’s. 


*See also Hadamard, ‘Sur les transformations ponctuelles,” Bull. de 1 Société 
Math., Vol. XXXIV, 1906. Pp ) a Société 


SPECIAL CASES 161 


We may now show that the 2’s are continuous functions of 
the 7's. 

Let (b,, b,,++-, bn) be a definite system of values of the ys, 
and let the system (a,, d,,+++, @) of the a’s correspond to this 
system of the y’s. 

If we then write 
(y= b+ ™ | 
< 


13 
[13] | @= a, + & | 


(A=1, 2,°++, 2), 


the system of equations [1a] or [1] goes into 
y+ b= F(a, + &, Gg+ &,--*, On +6,) (=1, 2,---, 2), 
[18] m= Fy + Sy Mat Fa, +003 Ont En) — Fy (ays Ags +++) Gn) 
(A Sal Scere = 10) 
Developing this expression according to powers of the &’s, we have 


p=n 


[1 c] UN => Orne (A = ip 2; Aeaked N), 
B=1 


where the Ca are functions of the a’s and és. If the &s are 
indefinitely small, we may limit the Cj, to the first derivatives 
of F,. In this case we denote the coefficients of [1¢] by C,,, so that 


OF, 
Ch, = =— for (a = ay, %y = Aq, >> +, Ly = A) 
OL 
A=) 2,° 9 15 4, Li, s0'%, N), 
and the determinant of the equations [1c] goes into 
OF, 


for (a4 Sp — Co en An) 
be ’ 
(Nile Dae oe a th eyo 25 %) 

If the 2’s lie within definite limits, this determinant remains 
always above a definite limit. We may therefore say that the 
determinant has a value different from zero. Consequently the 
condition that the equations [1c] may be solved is satisfied, and 
it is seen that indefinitely small values of the &’s must correspond 
to indefinitely small values of the 7’s. 

This means nothing more than that the functions x are con- 


tinuous functions of the y’s. 
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125. The above investigations are true under the assumption 
that the functions F, are continuous, that their first derivatives 
exist and likewise are continuous within certain limits. We need 
know absolutely nothing about the second derivatives. 

Of the z’s, of which it is already known that they exist as 
functions of the y’s and vary in a continuous manner with them, 
it may now likewise be proved that they, considered as functions 
of the y’s, have derivatives which are continuous functions of 
the y’s. 

The proof in question may be derived from the following con- 
siderations: If from [1c] we express the &s in terms of the 7’s, 
we have 


[3¢] e a 


LN ae (uw =1, 2,°+°, 2). 


The Aue are continuous functions of the &s, and the &s are 


; : A 
continuous functions of the »’s. Hence —“ 


as continuous functions of the 7’s. 
If the 7’s become indefinitely small, then the &s become 


may be represented 


indefinitely small, and we have definite limits for Ann 


In general, if we have a function f(«,, +--+, #,) of the n variables 
X14, %q,+++, ,, and if we consider the difference 


J (a a é, Hs a eS = * 95 (Gy ss Ee!) — f(a; Mp, °° 5 An), 


it is seen that it may be written in the form 


A=n 


DOr +A) & 

A=1 
where the H, depend upon the &’s and become indefinitely small 
with these quantities, and the }, are, in virtue of the definition 
of the differential quotient, the partial differential quotients of f 
with respect to x, for the system of values (4, +++, 4). From 
the above it results not only that the 2’s are continuous func- 
tions of the y’s but also that the derivatives of the first order 
of these functions exist. 


SPECIAL CASES 163 


We have, indeed, the derivatives of the first order if in the 
. A : 
expressions aN we write the &s equal to zero. 
The quantities ae however, become then, in accordance with 


[1¢], the quantities which we should have in [1c] if we had at 
first written C,, instead of C\,,. 
But the quantities C,, are continuous functions of a, a,,+++, a 


ne 


We may therefore say that the differential quotients a are con- 


tinuous functions of the variables #, and it is then proved that 
the «’s are such functions of the y’s as the y’s are of the z’s. 
126. For the complete solution of the second part of the 
theorem in § 116 we have yet to show that the expressions [36] 
may be reduced to the form [3a]. ' 
For this purpose we must bring the quantities ie in [30] 
(§ 124) to the form : 


where 0,, is the value of “as when all the z’s are equal to zero. 


sy S is a function of the z’s, but the 2’s are functions of the y’s, so 
that Y is a function of the y’s which vanishes when they vanish. 
be 
We may therefore in reality write [35] in the form [3a] 


A=n 
[3 a| By => (ix aan VOLE (Mm = il 2. SEs N). 
A=1 


127. There may arise cases in which we know nothing further 
of the functions 7, as was assumed in § 116, than that they are 
real continuous functions. 

We cannot then conclude, for example, that the z’s may be 
developed in powers of the y’s; but we may reduce the equations 
to the form [3a] and show that the equations [1a] are solvable. 

The theorem which has been proved is of great importance 
when applied to special cases, even for elementary investigations. 

If, for example, the equation f(z, y)=9 is given, then it is 
taught in the differential calculus how we can find the derivative 
of y considered as a function of «. 
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If we assume that the variables ~ and y are limited to a special 
realm where the two derivatives with respect to x and y do not 
vanish and therefore the curve f(z, y)=0 has no singular points, 
and if the equation is satisfied by the system (x, Yo), we may 
write <=a%,+& y=yt7. We have then f(x + & yy +)=9 
and we may prove with the aid of the theorem in § 118 that 7 is 
a continuous function of & and has a first derivative. Not before 
this has been done have we a right to differentiate and proceed 
according to the ordinary rules of the differential calcilus. 


MISCELLANEOUS PROBLEMS 


1. Show that the problem of determining the extremes of the function 
J (& y) may be reduced to the determination of the upper and lower limits 
of this function under the condition that 2? + y?=r% (Stolz, Wiener Ber., 
Vol. C, p. 1167.) 


2. Find the shortest distance from the point P(2,, ¥,, 2,) to the plane 
Az + By+ Cz+D=0. Az, + By, + Cy +D_ 

3. Find the points on a given sphere which are the farthest from and 
nearest to a given plane which does not intersect the sphere. (Pappus.) 


Answer. 


4. Find the triangle of maximum area whose vertices V,, V,, and V, 
describe respectively three given plane curves C,, C,, and C;. When the 
three curves reduce to the same ellipse, show that there are an infinity of 
triangles of maximum area (a case of improper maximum). 


5. Find the ellipse of least area that may be drawn through the three 
vertices of a triangle. 


6. Find the ellipsoid of least volume which may be drawn ap the 
four vertices of a tetrahedron. 


7. In a triangle of greatest or least area circumscribed about a curve, 
the points of contact are the mid-points of the sides. 


8. Among the triangles whose vertices are situated respectively upon 
three given straight lines in space, which is the one whose perimeter gives 
a maximum or minimum? Also determine the triangle of maximum or 
minimum area. 

Answer. In the first case the bisectrices of the triangle are respectively 
normal to the straight lines described by the vertices; in the second case 
the altitudes of the triangle are perpendicular to these lines. 


SPECIAL CASES 165 


9. Upon a fixed surface find a point P such that the sum of the squares 
of its distances from n fixed points A,, A,,:-:, A, is a maximum or a 
minimum. 

Answer. If the tangent plane at P is taken as the zy-plane and the normal 
to this plane at P as the z-axis, the center of mean distances M, say, of the 
points A lie upon the z-axis. It follows that the points P are the feet of 
the normals which may be drawn to the surface from M. 


10. Show that the semi-axes of a central section of a quadric 
A,x? + Aoy® + Anz? + 2B yz + 2 Bi zx + 2B zy +1=0 


are the roots 7? of the equation 


Peete be, Bs, l 
Be. be Bas 7m) 5 
B,, voen TEA et ne 
l; m, n, 0 


where the section is made by the plane 
le + my + nz = 0. 


11. Show that the axes of the quadric of the preceding example are 
the roots of the following cubic in 7?: 


Aen Ber) Bere 
Bef, ee Ale bE — (0), 
Bon, Bates 1+ A,r? 


12. z=4v—yis to be a maximum, where y?— nyz + x? = 0 and v—x=y. 
(Hudde, 1658. See Descartes, Geom., Vol. I, pp. 507-516.) 


13. The fundamental theorem of algebra. Let f(t) be an integral function 
of ¢ with constant coefficients. Write t= x + iy, so that 


(1) SQ =P(azy) + 1, Y=P + iQ, 
with the identical relations 
? aP _ aQ eP__aQ 
(2) ox = oy end oy Or 


Form the expression p(a, y) = p = P?+ Q®. Within the circle of radius 
r= V2? + 7? the function p is everywhere continuous, so that (§ 8) the 
function » must reach its lower limit for values of « and y within or 
on the boundary of the circle. By taking r sufficiently large it is seen 
that the lower limit of 4 must be reached within the circle, so that there 
must be a minimum value of p. Show that this minimum value is zero, 
and consequently that there must be some value of ¢ which makes /(¢) = 0, 
provided that f(t) is not a constant. In particular the semi-definite case 


must be considered. 


CHAPTER VIII 


CERTAIN FUNDAMENTAL CONCEPTIONS IN THE THEORY 
OF ANALYTIC FUNCTIONS 


I. ANALYTIC DEPENDENCE; ALGEBRAIC DEPENDENCE 


128. If in the development of the conception of the analytic 
functions we start with the simplest functions which may be 
_ expressed through arithmetical operations, we come first to the 
rational* functions of one or more variables. The conception of 
these rational functions may be easily extended by substituting in 
their places one-valued functions, and first of all those which may 
be expressed through arithmetical operations, viz., sums of an 
infinite number of terms of which each is a rational function, or 
products of an infinite number of such functions. 

Such a transcendental function is, for example, 


0 (@) = Uy (L) + Uy (@) + ++ + Uy (@) H+ Up gs (4) +s, 


where u;(#)[¢=1, 2,---] are rational functions of w The 
necessity at once arises of developing the conditions of con- 
vergence of infinite series and products, since such an arithmetical 
expression represents a definite function only for values of the 
variable for which it converges. Mere convergence, however, is 
not sufficient if we wish to retain for the functions just mentioned 
the properties which belong to the rational and the ordinary 
transcendental functions. All such functions have derivatives, 
and we shall restrict ourselves once for all to functions which 
have derivatives. 

Furthermore, the derived series of the above expressions of one 
variable must converge wniformly (gleichmdssig) in the neighbor- 
hood of each definite value, and every term of the derived series 


* See Hancock, Elliptic Functions, Vol. 1, pp. 6-9. 
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must be continuous in the same neighborhood. (Osgood, Lehrbuch 
der Functionentheorie, p. 83.) 

129. When we say that a series whose terms are functions of 
one variable converges uniformly, we mean the following :* It is 
assumed that the series in question has a definite value for x = x». 
We consider all values of z for which x —, does not exceed a 
definite quantity d. This determines a fixed region for z, within 
which we shall suppose that the series is convergent. This region 
is known as the region of convergence (Convergenzbezirk). We 
may, for brevity, put 


Ry(@) for Upsy t+ Meta + o+> 


in the series above. In order that this series converge uniformly, 
it must be possible after we have assumed an arbitrarily small 
- positive quantity 6, and when a remainder &,(a) has been sepa- 
rated from the series, to find a positive integer m so that 


| 2, (x)|=6, where k>m 


for all values of « within the region of convergence.t 

130. Proceeding in this way we may form more complicated 
expressions; for example, we may let w(#) be a sum of an infinite 
number of terms where each term is a transcendental function 
like v(a) above, so that 


w(a)= ry (@) +m (@) ++ 


We may continue by forming similar expressions out of the 
transcendental functions w(«), ete. It is clear that if we proceed 
in this manner, there is no end of such expressions, so that even 
if we limit ourselves to one-valued functions, we do not obtain a 
clear insight into the possible kinds and forms of such functions. 

It is essential that all such transcendental functions have a 
common property, and we note that if we take a value x) within 
the region of convergence in which the series representing these 


* Weierstrass, Collected Works, Vol. II, p. 202, and Zur Funetionenlehre, § 1. 
+ See Dini, Theorie der Funktionen (page 137 of the German translation by Liiroth 


and Schepp). 
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functions converge uniformly, they may be represented for all 
the values of zx in the neighborhood of z, as series which 
proceed according to positive integral powers of x—a,; for 
example, in the form 


F(a) =F (a — Ly + Lo) = Ayt a, (@ — Lo) + ay (x — Ip)?+ oni 


where dp, 4, 4,++-+, are definite functions of z. From this it 
follows that they may be differentiated, and a number of other 
properties are immediate consequences. 

131. We may next extend the conception of uniform conver- 
gence to functions of several variables. With Weierstrass (loc. cit.) 
consider the infinite series 


v=@0 
F(a, a9, sine Ly) => 4, (X, X95 oe Ly) 
v=0 


whose terms uw, are functions of an arbitrary number of variable 
quantities 7, %,-++,%,. Such a function converges uniformly in 
any part (R) of its region of convergence when with a prescribed 
quantity 6 chosen arbitrarily small there exists a positive integer 
m such that the absolute value of 


v= 0 


>% (21, Ha,°% %5 Ln) =6 
v=k 


for every value of k which is = m and for every system of 
values of 2, %,++-+,%, Which belongs to (R). 

Let a, @2,+++, @, be a definite system of values of the vari- 
ables #,, %,+++, Z, within the region of uniform convergence, and 
consider only the values of «,, 7, +++, x, for which L1— Ay, Ly— Ag, 

+, Z,—4@, do not exceed certain limits dj, do, ++ -,d,, as in § 129. 

The function may then be represented through an ordinary 
series which proceeds according to integral powers of a —ay, 
d_— Ug, Hz3— Ag, ++ +, Ly— Ay, and consequently may be differentiated; 
in short, it behaves, as Weierstrass * was accustomed to express it, 
like an integral rational function in the neighborhood of a definite 
position within the interior of the region of uniform convergence. 


*In this connection see Weierstrass, Werke, Vol. II, pp. 135 et seq., and also Bier- 
mann, Theorie der Analy. Funktionen, pp. 429 et seq. 
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132. We may next introduce the conception of analytic 
dependence. If we represent a function which has been formed as 
indicated above by F(a, a%,-++, v,), then F(a, a, +++, Ep 0 
expresses a certain dependence among the variables 1, %,- ++, x); 
that is, among the infinite number of systems of values for 
which the function has a meaning, those only which satisfy this 
equation are to be considered. There exists, therefore, among 
2, %,+++, Z, a dependence of a similar character, as in the case 
of algebraic equations. If we choose the quantities 7, %, +++, #, 
such that the equations 4,=0, #,=0,---, #,=—0 exist where 
m <n, we have a dependence among the quantities x, 2%, +++, %, 
defined in such a way that at all events we can choose arbitra- 
rily not more than 7 — m of the variables, since the remaining 
m variables are determined. 

133. The conception of the many-valued functions is at once 
suggested. Suppose, for example, a function of two variables x 
and y is given; then we may consider all the systems of values 
(z, y) in which « has a prescribed value. For such a value of « 
there may exist several values of y. We are to regard y as a 
function of x, and this function is a many-valued function. By 
the introduction of one or more auxiliary variables it is often 
possible to express the many-valued functions* through one- 
valued functions, and indeed in algebraic form. The development 
of analytic functions from an arithmetical or algebraic standpoint 
seemed especially desirable to Weierstrass. He wrote (see Werke, 
Vol. II (Oct. 3, 1875), p. 235): “The more I consider the under- 
lying principles of the theory of functions —and I do this con- 
tinually —the stronger am I convinced that this theory must be 
built upon the foundation of algebraic truths.” 

134. To illustrate the remarks of the preceding article, con- 
sider any analytic dependence existing between, say, two variables 
x and y and limit one of the variables ~ to a definite region. 
The other variable must be expressed through w and in a form 
that remains invariably true for all values of « in question. Now, 
if to the one variable there corresponds a transcendental function, 


* We might cite, for example, the Abelian transcendents. 
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it does not seem possible to express one variable arithmetically 
in terms of the other. We may, however, introduce a third aux- 
iliary variable and thereby express both of the original variables 
as one-valued functions of the third variable and in such a way 
that, if we give to this variable all possible values, we have all 
systems of values of (x, y). 

The simplest example is perhaps the one given by the equation 
z=, where z and y are two independent variables. It is not 
possible to express the dependence between # and y in an arith- 
metical form; that is, one in which transcendentals do not appear. 
But if we introduce a third variable ¢, and write z= e‘, we have 
z= e¥', so that eee 

Thus # and y are expressed as one-valued functions of ¢, and 
such that for one value of « there is invariably one value of ¢ 
and of y. Poincaré* proved that if x and y are connected by an 
algebraic equation, then all systems of values (z, y) may be 
expressed in the form just indicated. He also showed that if 
any analytic dependence exists between # and y, it is always 
possible to represent « and y as one-valued functions of a third 
variable. An example in point is the expression of the integrals 
of linear differential equations through the Fuchsian functions. 
However, he did not show in this latter case that all the points 
of the region in question were thus expressed through ¢. On 
the contrary it seems that there exists an infinite number of 


isolated points which can be reached only when ¢ tends toward 
certain limits. 


For example, in the differential equation of the hypergeometric 


series, we should have to exclude in such a representation the real 
values of # from +1 to +00. (See the Paris Thesis of Goursat.) 

In this manner the study of many-valued functions may be 
reduced to the study of one-valued functions. However, it is not 


*See Bulletin de la Société mathématique de France, Vol. XI (1883). See also 
lectures II and III, delivered in the Cambridge Colloquium, by Professor Osgood 
(Bulletin of the Amer. Math. Society, 1898); and the Problémes mathématiques of 
Professor Hilbert in the Comptes rendus of the Congress of Mathematicians, Paris, 


1900. In his treatment of Algebraic Functions of Two Variables, Professor Picard has 
done valuable work in this connection. 
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a simple task, for if we wish in reality to make this representation 
even in the case of a linear differential equation, we encounter 
many technical difficulties. Nevertheless, it is essential to prove 
that there exist such representations. 

Weierstrass asserted (May, 1884) that he believed the follow- 
ing theorem existed in the theory of functions: J¢ is always 
possible, where an analytic dependence exists, to express this 
dependence in a one-valued form which remains invariably true. 

135. We may next introduce the following theorem, which is 
extensively used, particularly in the calculus of variations: 


Suppose that between the variables x1, %,+++, % we have m 
equations given which may be represented in ule Jorm of power- 
series, and let these be 


1,1 (1 — a)+ ee: + Cin (Dl Oge) )+X, = 0, 
€2,1 (a — ay) + aes + Co, oo RO Sp = = 0, 


Cm, 1(%1 — 4) + +++ £m, n (Ln — Gn) ee 0 (m<n), 


where xX: x: ae Ne are also power-series of a%,— 4,+++; 
Ly— Ayn, but of such a nature that each term in them is of a higher 
dimension than the first. 

The equations will be satisfied for a= dy,+++, L,= a, We 
propose the problem of determining all systems of values 
(a4, La,+++, %,) which lie in the neighborhood of (a4, ,+++, Gn) 
and via satisfy the m equations above; that is, among the 
systems of values for which | a,— a4, +++, |%,_— On| are smaller than 
a fixed limit p, determine those which satisfy the m equations. 


The quantity p is subject to the condition only of being sufti- 
ciently small. To solve this problem we consider the system of 
linear equations to which the given equations reduce when we 


write X,= 0, X,= 0, sae, X,,= 0. 


Through these linear equations m of the differences 2 — a, 
i he —m 
g— Wg, +++, Lm— Fm May be expressed in terms of t 
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remaining, if the determinants of the mth order which may be 
formed out of the m rows of the ¢’s are not all zero. 


If, say, (1, fn 7% 5 C1, m 
- my = 0, 


Cm, ih Ares n,m 


we have (§ 117) 


! 
My = Ch, iene Am +1) Speirs Went Cae (t_— n)+X 4 


= 
hy — Ag= C4, 1 (Lm 41 — Om +1) a ile dae C, n A Ay) + <5 


nf) ! By, bone rf 
Ln — bm = Cm, 1 (oe Am +1) St a Cm, n—m (Zp hn) +X 


m 


By means of these equations we may represent #;— a, %— My, 
++, L_— Gm aS power-serles in the remaining n — m differences, 
the formal procedure being as follows: 

We write xX = 0, - 2s, GS 0, and thus obtain for x, — a, 
+++, Lm — py. expressions which represent the first approximations. 
These are substituted in Xe ny NS and in the resulting ex- 
pressions only terms of the second dimension are considered. 
These terms added to the terms of the first approximations respec- 
tively constitute the second approximations. Continuing this 
process we may represent the required expressions to any degree 
of exactness desired. 

We obtain the same results if we express m of the quantities 
Ly — A, Ly— Ag, +++, L,— a, through power-series in terms of the 
remaining n — m quantities with indeterminate coefficients. These 
coefficients may be determined without difficulty. 

As just shown, these power-series are convergent as soon as 
the differences «—a which enter into them do not exceed cer- 
tain limits, and, furthermore, these power-series satisfy the given 
equations. 

136. The problem of the preceding article may be solved in 
the following more symmetric manner, in which none of the vari- 
ables is given preference over the others (see Lagrange, Théorie 
des Fonctions, Vol. II, § 58). 
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Besides the equations given above we introduce others which 
are likewise expressed in power-series : 


Let m43,1(%— %) + +++ + om tin (Bn — In) +X, = by 


Ons (2, 7 ay) cima si: Cn,m (Ly — Gy) +X.= Lens 
where ¢,, t,+++, t,_m are m — m new or auxiliary variables. 


The quantities ¢ are arbitrarily chosen, in such a manner, 
however, that the determinant 


, Lb Cy, 29 atta s Cy, m q, m+ ay: Cy, n 
C9, Lb C9, 2 evan C9, m) C9, m+ Or C9, n 
Cn, L Cm, 2 BASIE Cm, ms mn, m+ RES) Cn, n = 0. 


Cn+1,1 Sm41,29 °° *> Sm+1,m> Cntl,mtl ** "> Cm+1, n 


C ¢ 


' r 
Cn, 1 Cn, 2 sty Cn. ms n,m+19 te EO 


Proceeding as in § 117 we write the quantities X equal to 
zero, and we thus have a system of » linear equations through 
which we can express the n differences #,— a, +++, %,— 4d, 
through 4, t2,++°, tm, Say, 

Ly — 0, = by ity + 6,,ota + +++ 6,,n—mtum +X, 
V=H1, 2535 +) 2): i 
With the help of these equations we can express 2 — ay,+--, 
Ly, — Gy, aS power-series in ty,+++, ty _m- 

To do this we again write X= 0, and have only terms of 
the first dimension. We write the first approximations that have 
been thus obtained in <i and by retaining the terms of the 
second dimension derive the second approximations, etc. 

It follows directly from the above that these power-series in ¢ 
formally satisfy the given equation; that they possess a certain 
common region of convergence if we give certain fixed limits to 
[t1|, |éel,+°°, |t»_m|; that they consequently in reality satisfy the 
equations ; and, finally, that all the systems of values (1, %, +++, Ly) 
which lie in the neighborhood of (a, a, +++, @) and which 
satisfy the proposed equations are obtained in this way. 
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137. Suppose that between two variables there exists an ana- 

lytic relation which is expressed in the form 

P(t — Hey ¥ — Yo) = 9 
where P denotes simply a power-series and where 2, Y 18 a 
definite pair of values of the variables. 

In the neighborhood of (<p, y)) there is an infinite number of 
systems of values which satisfy the equation. The collectivity of 
these pairs of values («, y) is called an analytic structure, or con- 
figuration (Gebild), in the realm (Gebiet) of the quantities (x, y). 

We may next make an application of the theorem of the pre- 
ceding article. It follows that, if between 2 quantities 7, 2», 

-+, #, there exist m equations in the form of power-series, 
then the differences a, --a,+++, %,—@, may be expressed 
through power-series of the 7 —m remaining variables. Weier- 
strass said: “ Through the m equations a structure of the (n—m)th 
kind in the realm of the m quantities 7, %,-+-, Z, is defined.” 

As in the case of two variables, we may proceed in a similar 
manner with several variables, among which an analytic depend- 
ence exists. Let this connection be of such a nature that m (<n) 
of the variables are in general determined through the remaining 
n—m. If, then, (a, +--+, @,) represents a definite system of 
values of the variables, there exist m equations of the form 


P(t — m4, He — a, +2, Ln, — An) = 0 


which are to be satisfied for x, =a), %2=d9,-*+, %,=a,. In the 
neighborhood of the position (a, @2,+++,@,) there are, then, an 
infinite number of other systems of values (2, 2%, ++, XL») Which 
satisfy the same m equations. “These define an analytic structure* 
wm the realm of the quantities 21, X, +++, L). 

A fundamental theorem in the theory of functions of the 
complex variable is that these structures may be continued over 
their boundaries. The power-series above constitutes an element 
of a complete structure (§ 97). 


* Weierstrass, Werke, Vol. II, p. 236. It may be remarked that Minkowski in his 
Geometrie der Zahlen advances similar ideas at considerable length. See in particu- 
lar § 19 of his work just mentioned. 
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138. Analytical structures, as above defined, may be represented 
in a different manner. If the equation connecting x and y begins 
with terms of the first dimension, we may, on the one hand, either 
express y—Yy through P(x— a) or x— x, through P(y—y); 
or, on the other hand, if the coefficient of either x — Lae OF 
Y¥—Yo 18 equal to zero, it is possible to express only y—y, 
or only «— x, as integral power-series of «— ax) or y—yo. In 
order that this distinction may not be necessary, we introduce 
a function ¢ which begins with terms of the first dimension in 
L2— Ly, ¥Y — Yo (See § 136); we may then always express the two 
quantities #, y as power-series of ¢. Through the introduction of 
such a quantity ¢ it is made possible to include within certain 
limits all the systems of values («— «x, y — yo). which satisfy 
this equation. These values must firstly satisfy the given equa- 
tion, and secondly they must afford all the systems of values 
which satisfy it within these limits. 

These considerations may be extended at once to equations in 
several variables. If we have a certain number of equations in 
Ly — Ay, Ly— Ay, +++, 2, —Ay, and if we limit these equations 
to terms of the first dimension, we have linear homogeneous 
equations of the first dimension, the number of which we assume 
to be m(< 1). 

If we can express m of the quantities 2 — a), %—g,++-, 
2, — a, through the remaining n — m, it is always possible so to 
_ derive n power-series of m—m quantities ¢, t,-+--+, t,_m that 
they, substituted for 2, %,---, %,, firstly satisfy the given equa- 
tions, and secondly, if we give to 4, t,++-, t,_m all possible 
values, they offer all the systems of values (a, %,+-++, %,) which 
satisfy those equations, when certain limits are fixed for the abso- 
lute values of 2 — 4, %— dg,+++, LZ, — 4,3 or, also secondly, that 
with indefinitely small values of the ¢’s they afford all the systems 
of values of the quantities a, %,-+-+-+, %», which lie indefinitely 
near the position (a, d,-+++, @)(see again § 136). 

Take n power-series ®, (t), Po(t),---, ®,(¢) and write x, =P, (i), 
% = B,(t),-+++, % = ®,,(¢t); then through these equations a struc- 
ture of the first kind (Stufe) in the realm of the n quantities a 
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is defined; in a similar manner a structure of the second kind is 
defined through the equations 


Hy = ®, (¢,, te), ky = ®, (t,, te)» SA Ne, Ln = ®,, (t,, te)» ete. 


In general, if we take m power-series in ¢,, tg,+++, t,_m and 
write these equal to 2, #,-+-+, 2», the collectivity of the sys- 
tems of values (#,, %,+++, %) offered through these equations 
constitute a structure of the Casa kind in the realm of the 
quantities 1, Ly,+++, Ly. 

We shall in the sequel limit the discussion of the general 
analytic dependence to the cases where this dependence is 
expressed through algebraic equations and to the structures 
which result from such equations, viz., the algebraic structures. 


Il. ALGEBRAIC STRUCTURES IN TWO VARIABLES 


139. Let F(x, y) be an integral algebraic function of « and y 


which does not contain repeated factors, so that F(#, y) has no 


common factor with either = or -. Further suppose that 
2 ¥ 


F(a, y) is not divisible by any integral function in which appears 
only one of the variablés « or y. The system of values x, y which 
satisfy the equation F(z, y)=0 form the algebraic structure that 
is defined through this equation. 

Tf 2, Y is a pair of values such that F(x, y¥)=0, we may 
develop the equation F(x, Y) in powers of #— a, and y—y, in 
the form (cf. Stolz, loc. cit., p. a 


[1] G(E, ) = dF (xp, w+ 5 OP (%, Yo)+> 
ee 
+ nie a Y)t++-=0, 


where for brevity we put «— a= £, y— y)= 7, and where a" Fao, Yo) 
is the homogeneous function of the nth degree in &, n, viz., 


| : i (n\_ or 
[2] a" F (xp, w=3(; oe Page 


CERTAIN FUNDAMENTAL CONCEPTIONS shed 


If ae and - do not both vanish, the position (or point) x», y, is 
said to be regular or simple. But if they both vanish for xz = a, 
Y¥ =Yo, and if for the same position all the partial derivatives of 
the 2d, 3d,.--, (t —1)st order of F(x, y) with respect to x and y 
vanish, while those of the Ath order are not all zero, the position 
Ly, Yo 18 called a singular position, and, specifically, a k-ple 
singularity. In such a case the left-hand side of equation [1] 
begins with terms of the Ath order with respect to & and ». 

In the following treatment not only the integer / plays an im- 
portant réle but also the smallest exponent of the terms that are 
free from 7, as also the smallest exponent of the terms that are free 
from &, on the left-hand side of [1]. If we denote the first by p 
and the second by g, the equation [1] may be written in the form 


[3] F (%+ E, Yo+ 0) 
= EP fa t+ Ef(E)}+ 07 {b + ng (n)} + En AE, ») = 0. 


Here /(&) denotes an integral function of € and g(n) an integral 
function of 7; @ and } are constants different from zero, viz., 


140. Developments of the algebraic function y in the neighbor- 
hood of a regular position. It may be shown * that 7/ on the position 


. Ie 
L= Lo, ¥ = Yo the expression - does not vanish (# that, say, q =1 
and b= ee , there is one and only one convergent series in integral 


positive powers of & which vanishes with E and which substituted 
for n in [1] identically satisfies [1]. 
We may suppose that this series begins with &?, so that 


a 
[4] a SEP bey oo 
We have also to consider in the sequel fractional positive 
1 


powers of z—a, = &; that is, powers, say, &, where w+ 1. A series 


* See Pierpont, Vol. I, p. 288; or Goursat, Cours D’ Analyse, Vol. I, chap. iii. 
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of this kind is convergent if there is a positive quantity R such 
1 


that the series for all values of |&“|< & is convergent, and that 
is for all values of |&|< R*. 

If the series is convergent for one of the » values of the wth 
root of £, it is evidently convergent for all the other » —1 values 
of & Accordingly, to each of the values of € whose absolute value 
is smaller than R“ there correspond yp different values of the series. 


If, for example, we denote a definite one of the values of &, for 
1 


example, the principal one, by &*, the others are expressed through 
1 
the product j&", where j is any of the wth roots of unity. 
Hence a series 


5] se on(VE)" (A>) 


n=A 


may, corresponding to the different values of j, appear in the » —1 
other forms ri 


> eng ( VE)" 


n=A 


The theorem stated at the beginning of this article may be 
generalized: If on the position «= x), y= y, the expression = 
2 


does not vanish, there is one and only one convergent power-series 
in positive integral or fractional powers of & which vanishes with 
— and which written for 7 in the equation [1] identically satisfies 
it, viz. the series [4]. For if besides the series [4] a series [5] 
with ~ >1 satisfied [1], then the equation 


1 
[6] F(t) +t", yy +n)=0, where t= &, 


would be satisfied by two series which have no constant term and 
in which 7 is expressed in integral powers of ¢. This, by the previ-- 
ous theorem, is impossible, because in [6] the term in 7 really 


appears, and in fact multiplied by the coefficient ae 
OF ar , “Yo 
141. Suppose next that a 0, but that an # 0, so that p=1 
is 


0 
and q¢>1. Then from what we have just seen it follows that the 
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equation [3] may be solved through only convergent series in 
which & is expressed in powers of 7 in the form 


b 
[7] ee ea A re On) Say, 
where g>1 and d,# 0; in other words, there exists a positive 
quantity S such that if |n|< S, we have the identity 


[8] G(Q(n), 0) = 9. 
Write € in the form 


b ae b 
—=— * | 1 yee <x |=- 5 wi) say, 


and note that 
1 


1 d. i 
[4 () J? =(1- pt ‘ =] — m n* + terms of higher order. 


If then we put 


a 
[9] t= n[O(n)]0=9— Fat 


by reverting this series we have 
[10] Dat SEO Pl, say, 
i 


and from this it is seen that a positive quantity AK may be so 
determined that for all values of ¢ such that |t}< A the above 
power-series in ¢ converges. This power-series when written for 
in the equation [9] identically satisfies it. 

If, further, we raise the equation [9] to the gth power and 


multiply it by — ib we have 
a 
b b b 
—~ t= — ~99Q, (9) = —— 99 + dantt? + --- = @(m) above ; 
a a2 a 


and this equation will be an identical one if for 7 we write the 
power-series P(¢). The same is true of equation [8]; that is, we 


have the identity b 
G (- aot Pe) =O 
a 


for all values ¢ for which the series P(#) is convergent. 
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If we denote the radius of convergence of the series P(t) by R 
and put §=— t7, where ¢ is any one of the g-values of the qth 
a 


root of — ef we have the following theorem: 


Tf. \|é\|< ae so that the series 
a : 


[11] iy E48) - 


exist, 7 denoting any of the qth roots of unity, then this expression 
written for n causes the function G(E, n) to vanish identically. 

Furthermore, there is only one such convergent series in inte- 
gral or fractional positive powers of &, without constant term, 
which when substituted for 7 in equation [1] causes that equation 
to vanish identically. 

For if there were another such series in integral positive powers 

1 
of &, say, 
A 

[12] n=oh+--- (W21, 6% 0), 

then in the manner given above we could express &, and conse- 
1 

quently also €,through a power series in 74 which identically satisfied 


[1]; but besides the series [7] there exists no such series, and conse- 
quently there is no such series as [12] which is different from [11]. 


III]. METHOD OF FINDING ALL SERIES FOR ¥ WHICH 
BELONG TO A k-PLY SINGULAR POSITION * 


142. In equation [1] let d(x», y), d?F' (a, Yo), + + +, d® LF (xp, Yo) 


be zero, so that this equation becomes 


d. i ; 


u 
ae mi (Xo Yo) = 9, 
where N is the dimension of F(a, y) with respect to x and y. 


* Besides Stolz, p. 182, see also Puiseux, Journ. de Math., 1st Series, Vol. XV, 
p. 365; Picard, Traité etc., Vol. I, p. 392; Hermite’s preface to Appell et Goursat, 


Fonctions Algébriques ete.; Kénigsberger, Elliptische Functionen, Vol. I, p. 187 
et seq.; ete. 
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There is, consequently, a #-ple singularity at xj, y), and we shall 
next show that we may derive all those convergent series without 
constant term which proceed in integral or fractional powers of £ 
and which when substituted for 7 in [13] identically satisfy it, if 
we can derive corresponding series for any simple, double, up to 
(& — 1)-ple position of any algebraic structure. In other words, the 
problem of deriving these series for a é-ple singularity is made to 
depend upon the derivation of such series for a position that is 
less than £-ple. 

If for 7 in the homogeneous function* of the nth dimension 


TAR Yo) =D (5) 1) 


(it being supposed not identically zero) we write the series [12] 
and arrange in ascending powers of &, then if \ = yw, this expres- 
sion begins at least with &”, and exactly with this term if ®,(1, ¢,) 
does not vanish. If \ # pw, this expression begins with &” only 
when this term in reality appears in ®,(&, ); otherwise with 
a term of higher or lower order than &” according as \ > yu 
or A< Bp. 
If in [13] we next decompose the lowest differential 


d* (xp, Yo) — ®,(E, ”) 


into its real or complex linear factors, we have 


r=l 


[14] ,(E, n) =[[@a 72 rey 
v=l 
where k,+h%,+++++h,=k and where one of the two coefficients 


a,, 8B, may be zero. 
Assuming first that \ = w, if in the above expression we write 


n = C,€ +- cleus! 
we see at once that for at least one value of 7 we must have 
HC), — [op — 0. 


* The method given by Weierstrass, Werke, Vol. IV, pp. 19 et seq., is essentially 
the same as that found here; see also Stolz, loc. cit. 


182 THEORY OF MAXIMA AND MINIMA 


For if this were not the case, then G(&, c.€+.---) would begin 
with. £* instead of vanishing identically. 

If, next, > i, one of the quantities 8,, B,,---, 8; must be 
zero; and if A<4, then one of the quantities @,, a,,--- » % must 


vanish. For if ea were all different from zero, then G(€, [nee -) 


begins with ee. 
If a series of the form [12], where ’ = y, satisfies the equation 
[13], we shall have, if in [12] we write 7 =(¢x + ,)&, a relation 
1 2 


between 7, and &, viz., 7;= Cn + eyo +--- 
The expression G(&, (¢,+7,)&) contains the factor &*, which 
may be neglected, so that 7, satisfies the equation 


G(—, m)= G(E, (c, +) £)= 0. 


1 
a 
If in the series [12] (when X% >) we write 7 = 7,&, we find that 
the equation 


1 . 
Gy tz ny) = gk is 4o)= 0 
is satisfied by the series _ 


AK 


mane Pe ++ 


If a series for » where <p satisfies [13], we revert the process 
and make the substitution & = n,. 

143. In giving the practical method of determining the series 
for » which satisfies [13] we must make a distinction between 
two cases: The function ®,(&, 7) either may contain different 
linear factors to their respective powers or it is the kth power of 
one single such factor. 

First case. Among the quantities a,, a,,---, a there must be 
at least one which is not zero. For each a, which does not vanish 


we put Bs e® and make in [13] the substitution 
[15] 7 = (C+ n)é. 
We may then write . 


G(E, [ee m)€)= EG, (E, m), 
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where G, is an integral function in £ and 7, which vanishes for 
—€=0 and 7,=0. If, for example, i=1, we have 


18] Em) = 55 am [Lee — B,)+ em) 


E 
Teapi +m) 


ak 
+ +++ +2 By (1, +m). 


From this it is evident that the position €=0, 7,=0 in the 
structure G(&, 7,) is at most a k,-ple singularity and conse- 
quently less than a k-ple, so that the problem may be regarded 
as solved, since, by hypothesis, when 4,< k we have supposed that 
we may derive all power-series which satisfy G(&, 7,)= 0. For , 
through the formula 7 = (c+ »,)& we have series arranged in 
integral or fractional positive powers of € which substituted in 
G(&, 7) cause this expression to vanish identically. Besides these 
series there are no other such series for 7 which begin with the 
term cVE, 

_If in [15] we let 7 take all the values ti a,# 0, we have 
in this way all those series for 7, where X= yp, which satisfy the 
equation G(&, »)=0. Among the quantities a, a,,---, @, there 
may be one, for example @,, which is zero. If we consider » and & 
interchanged and then make in [13] the substitution & = n&,, we 
may derive all series which proceed according to integral or frac- 
tional positive powers of 7 with constant term zero and which 
when written for & in the equation G(E, 7)= 0 identically satisfy 


be 
it, and whose initial term is dn’, where w>d. 

By reverting each of these series we may express 7 as series 
in terms of & which satisfy [13], where X< pm. 

Further, we have all such series. For if [13] was solved by 
writing for 7 a series [12], then we also eee [13] by writing 


for & a series in integral positive powers of n* whose initial term 


M . 
contains 7’, where < is an improper fraction. 
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Second case. Let ®,(E, n)=(an —B&)* and suppose first that 
a#(. We make in [13] the substitution 


n=(2+n)é (i) 


and have, after division by &, the new equation 


I 
= 


k 


If for this equation the position €=0, »,=0 is less than 
a k-ple singularity the problem is by hypothesis solved, or if it 
remains a k-ple singularity and if the polynomial of the terms of 
the Ath order in € and 7, may be decomposed into different linear 
factors; we may proceed as in the first case. It may happen, how- 
ever, that the position = 0, 7, =0 is a k-ple singularity whose 
terms again form the Ath power of a linear expression in & and 


m, which must necessarily be A (an; — B,&)*. 


If, further, we write in [17] 7, instead of ,, where 7, is defined 
by the equation 8 
= & + ns) g, 7) 
the expression will be divisible by &*, so that we may write 
(6 (E as ns) f)= EG, (&, No)» 


where G,(€, n= a) +€H,(E 75), 


H,(E, 2) being an integral function of & and y,. 
Noting (¢) and (7) it is seen that if there is for 7 a series 
of the form 


ee By a+2 
(18) ofp erage... 


then for €=0 the quantity , introduced above must be zero, 
and 7, must belong to those series that vanish with £ and which 
are obtained from the equation G,(E, 7,)= 0. 


eit i 
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This equation may be solved as above for 7, if the position 
€=0, »,=0 for the structure G(E, 72)=0 is less than a k-ple 
singularity or if it is a A-ple singularity in which the terms of 
the kth order do not constitute the Ath power of a linear func- 
tion of € and 9. We further have all series, proceeding according 
to powers of € without constant term, which when substituted in 
[13] satisfy it, if we solve the equation 


G(E, 7) = 0 


with respect to 7, in all possible ways through power-series in & 
without constant term and substitute these series for 7, in the 
expression (cf. (7) and (2)) 


=F es Pips ye 


But if the position €=0, 7,=0 is a k-ple singularity in the 
structure G,(&, 7,)=0, and if the terms of the kth order form 
the kth power of a linear expression in &, 7,, which must have 


the form 7 — ££)", we must write 7, instead of y,, where 7, 
is defined by me (@ ee ns) é, an 


and proceed in a similar manner as above. 

Continuing in this manner it is evident that if a#0 we may 
derive all power-series in € without constant term which written 
for 7 in the equation [13] identically satisfy it, if through a 
series of transformations 


[19] n=(E+m)é = (S+n)é oes, m= (7 


we may from the given equation G(&, 7)=0 derive an equation 
G,(E, ”,)= 0 whose left-hand side does not begin with the /th 
power of a linear expression in & and 7, si 

We must finally come to such an equation if F(z, y) and by 


Bie 1 


= m)é 


have no divisor in common. For, since the factor &* appears with 
each of the substitutions [19], it is easily shown that the integer 
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h in [19] cannot pass a fixed limit. For if F is of the nth degree 
in y, we may always find two integral functions U and V in wand y 
where U is at most of the (n —1)st degree in y and V at most of the 
(n — 2)d degree in y such that there exists the identical relation 


[20] VF (x, y)+ 0S D(z), 


where D(x) is an integral function in 2. 
Furthermore, since 


a(e (E+n)é)= ere», 
ci(€, (= + no)B)= E*G4(E »), 


Gina(€ (= +) 8) = BFGHE mo, 
it is seen that 
F(x) + & yo +m) = GE, 0) = EMG (E, 11). 
We also note from the formula 
Fw, y +0) = F(x, roe tien 
if we make the substitution «= + & y=y,+7, since 


F(a + ‘s Yor n)= GE, n), 


OF (: 
that G6 1+ )=G6n+|FEn| oe ere 
cy L=M+é 
Y=Yotn 
Expanding the left-hand side of this expression, it is seen that 
een) _ GE, 0) _ me 2G On _ gna On, 
OY demwte Oy On, On Onn 
Y=Yotn 


It follows that after the substitution of 
“=a+& y=y +n, where from [19] 


B, B 
[21] n= Fete +(Etet +m, er 
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the left-hand side of [20]'is seen to be divisible by &#-),. But 
on the right-hand side D(x) + &) is of the same degree d, say, in & 


as D(x) isin x It follows that h(k-—1) Sd orh= a 


If, secondly, a = 0, or ®;,(E, n) =(— BE), it is seen that through 
a corresponding change of the method given above, all series 
which proceed according to powers of 7 without constant term 
may be found which when written for € in the equation [13] 
identically satisfy this equation. Through reversion of these series 
we derive series in powers of € without constant term which satisfy 
the equation [13] with respect to 7, and in fact all such series. 

144. The following theorem is proved by Stolz (Math. Ann., 
Vol. VITI, p. 438): If x, yp is a position of the structure F(a, y) = 0 
and if this equation is brought through the substitution «= x, + &, 
¥=Y,+7 to the form [3] above, viz., 


[3] Flay + &, yo + y= & (a + EF(E)) + 072 (6 + ng (n)) 
aa En H(E, n)= 0, 


then the collectivity of the convergent series in integral positive 


1 A 
powers of & or &, viz, c€*+.---, which vanish with &, and 
when written for 7 in the equation [13] satisfy it, are charac- 
terized through 

ye = 95 >» make 


In these expressions w is the smallest of the roots of € which 


are contained to an integral power in each term of a series in 
1 


question, and A is the least exponent of & in this series. This is 
illustrated in the example of the next section. 

145. The above theorem offers a check for the determination 
of all the series which belong to a singular position of a function, 
as is illustrated in the following example. 

Example. For the algebraic structure defined through the equation 

4 x2y3 — 9 xy? + 2 xSy — Qlay> + 8y’ —102%=0 (7) 
the point z=0, y=0 is a 5-ple singularity. The terms of the fifth 


order in (i) are 42%y3 and consequently may be decomposed into the 
factors x and y. 
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Corresponding to the factor y, write in © y = xy, The result of the 
substitution is, after division by 2°, 


4y3 — 9 ay? + 2 x2y, —10 25 — 21 22y8 + 8 ay = 0. (ii) 


The point z= 0, y, = 0 is a triple singularity for this structure, the terms 
of the third order being 


4y3 — 9 xy? + 227y, =¥, (49, — 2) (y, — 27). (tit) 


Corresponding to the first factor, write in (ii) y, = xy, and divide the 
resulting equation by z®. We then have 


2y, —10 27 — 9 93 + 493 — 21 sy? + 8 ays = 0, 
where z = 0, y, = 0 is a simple point. From this equation we have 
Yo = 5 2? + 
We thus have as a solution of (2) 
y = xy, = xy, = 5 xt + terms of a higher order. (w) 


Corresponding to the second factor in (iii) write in (i) y, =x(4 + y,) 
and divide the result by z°. We then have 


— fy, ~l0et— 6 y2 +4 98 + <-- = 0 
and from this we have Yo = — AP ar 4--- 


It follows that (2) is satisfied by the series 


2) 40a 
y= (hy) = ee () 


Corresponding to the third factor of (iii), write in (i) y, =x(2+y,), 
and dividing the result by z* we have 


14y,—1027+15y2+4y8 + -- 
From this it follows that y,=?2?+---; and the corresponding value 
of y is y =Qa2+ Satpe.., (vi) 
Returning to (7) write x = yr, so that (7) becomes 
4a? + 8y? — 21y%x, — 9 yxt + 2 yah —10 21% = 0. (vit) 
For this structure y = 0, x, = 0 is a double point, the terms of the second 


der being 9 
order being fap +8 y= 4(e, + tyV2)(x, — iy V2), 


Corresponding to the factors of this expression we make in (vi) the 
substitutions 


t= y(+iv2 2+ 2,). (viit) 
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If then we divide through by 97, we have the equations 
+(8a,—2Qly)iV2—2lyz,+4a24---=0. 


From each of these equations we derive series which begin with the same 
term, viz., x, = *s+y + ---, so that we derive the two series 


a= yr, = (tiV2 + 2,) = + iy? V2 + Ad yrtees 


By inverting these series we have the series which proceed in ascending 
powers of 2%, viz., 


| = 
PN oo ae (iv) and (2) 


We have thus derived five power-series which proceed in integral or frac- 
tional powers of x without constant term which satisfy (7), viz., (i), 
(v), (v2), (tv), and (2). 

It is further seen that $u=1+4+1+1+4 2 +42 =7, which is the smallest 
exponent of the terms that are free from z, while SA=4+2+2+1+4+1=10, 
which is the smallest exponent of the terms that are free from y in (?) 
(Stolz, p. 195). 
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